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A toxic legacy 


Illegal dumping of toxic waste in the Italian Campania has been blamed for high rates of ill health in 
the region. The suspected link needs to be investigated using the most modern methods. 


believes the tabloid press. From the air we breathe to the food we 
eat and the fun we have, the risks of normal, everyday life have 
never received more attention. 

But the people who live in the fertile agricultural landscape of Italy’s 
Campania region, around the historic city of Naples, have genuine 
reason to be anxious about their health. For decades they have lived 
on top of potentially lethal toxic waste, illegally and secretly dumped 
there by the mafia. Rates of some cancers are higher, and lifespans 
shorter, than elsewhere in Italy. Complaints, though, have been 
continually silenced. 

But Italian environmental police, working closely with anti-mafia 
forces, have used smart technologies to discover the subterranean 
dumps and are now systematically excavating them. Environmental 
officials are analysing soil and water. The crater walls look like geo- 
logical strata, each layer formed by different types of waste: asbestos 
from demolished buildings, dioxin-rich chemical sludge, drums of 
solvent, the odd motor vehicle. Black water forms pools at the bottom. 
No one is allowed to even look into the stinking, steaming pits without 
a respirator mask. 

Massive street demonstrations in Naples last autumn prompted 
officials to take action. The national government approved funding 
earlier this year for a two-year programme in which the several hun- 
dred thousand people who live close to the dumps will be screened for 
cancers that have been linked to environmental exposure. The health 
ministry has analysed all the health and environment studies carried 
out so far and concluded that there is no evidence yet to link the dumps 
to cancer. Yet local people are convinced that noxious run-off from 
the dumps is gradually killing them and their families, loading their 
bodies with toxic chemicals and twisting their cells into tumours. They 
want answers, and they want the scientists to supply them. 


I is tempting to see health hazards everywhere, especially if one 


TOXIC BLACK BOX 
Environmental police have so far identified 32 sites containing an 
estimated minimum of 3.5 million cubic metres of toxic waste. But 
without crucial information about actual exposure, including dose 
levels, it is impossible to determine whether the dumped chemicals 
have raised cancer risk in what is a poor region, where people smoke 
more and have unhealthier lifestyles than in other parts of the country. 

Similar issues arise whenever cancer clusters emerge around nuclear 
power plants or industrial sites. Attempts to prove a causal link face 
several dilemmas. One is that the number of cancer cases is usually 
too small for conclusive statistics. Another is that the cancers usually 
become apparent years after the hypothetical exposure to carcinogen, 
and such historical exposures are almost impossible to prove scientifi- 
cally. But usually, the identity of the feared chemicals is known. 

In Campania, the challenge is greater because the very fundamen- 
tals are unknown — not only the location and chemical content of the 


dumps, but also the true local incidence of cancer. Over the past year or 
so, that information has started to emerge. And biologists there are won- 
dering if they might be able to tie their whole noxious experience into the 
global research effort to develop methods to prove chemical exposure 
scientifically. Could their poisoned fields serve as a giant experiment in 
the new science of ‘exposomics, which aims to identify biomarkers of 
past and present exposure to toxic environmental chemicals? 
“Campania could be a perfect field study for a biomonitoring 
research programme,’ says Gennaro Ciliberto, scientific director of 
the National Tumour Institute Pascale in Naples. 
Precedents do exist. One is in the city of 


“Campania Thessaloniki in northern Greece, where 
could bea newly impoverished people have started 
perfect field to burn more biomass for home heating, 
study for contributing to smog. A research project 
biomonitoring funded by the European Union combines 


regular analyses of atmospheric pollutants 
with analyses of urine and blood of a cohort of 
people to quantitatively determine how their gene-expression, pro- 
tein and metabolite profiles change as a result of pollutants entering 
their bodies. 

Italy, in austerity mode, has little money to spare for research. So 
Ciliberto has a suggestion. “This is the sort of programme that should 
be funded from the structural funds that the European Commission 
has awarded the region” 

He could be right. These subsidies came to a total of €6.9 billion 
(US$9.6 billion) for Campania alone in 2007-13, and the commis- 
sion explicitly encourages use of the funds to boost local research 
and development capacity. Allocations of the next tranche of struc- 
tural funds within Italy (2014-20) are currently under discussion. 
Ciliberto’s idea deserves serious consideration. 

The ancient Romans called the area Campania felix, or fortunate 
countryside, because of the soil’s fertility (conferred by its frequent 
coating with volcanic ash from Mount Vesuvius’ deadly eruptions). 
But modern Italians call it Campania infelix, and its agriculture-based 
economy is suffering from public perception that its products could 
be contaminated. 

Anti-mafia action made the waste-disposal racket too public to 
continue after 2003. But mafia domination continued to stifle public 
discussion about the environmental poisoning. Locals, government 
officials and even many academics have been reluctant to talk. Some 
feared retribution, but more feared that they or their friends would be 
exposed as having helped in the logistics of the operations, or as having 
accepted mafia bribes for turning a blind eye. People began setting fire 
to the dumps, making matters worse. 

The fires are out now, but the questions of ill health remain. They 
should be properly investigated. Even if the results prove inconclusive, 
the information gained would be worth it. m 


research.” 
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A question of trust 


NASA’s decision to renege on SOFIA project 
casts doubts over its reliability as a partner. 


lion per night. That is essentially the advertisement just posted 

by NASA in a bid to save its Stratospheric Observatory for 
Infrared Astronomy (SOFIA) — a jumbo jet fitted with a 2.5-metre 
infrared telescope. The plea for cash comes in the wake of the 
space agency’s surprise announcement last month that it would no 
longer pay the roughly $80 million a year it had planned to spend 
on SOFIA. 

The cancellation was a shock because SOFIA had only just reached 
full operational capacity (see Nature 506, 420-421; 2014). After more 
than a decade in development, and a bill of $1.25 billion, the plane 
finally had a full suite of astronomical instruments in place. Plans 
were being laid for SOFIA to start increasing its flight schedule from 
its home in Palmdale, California, towards the goal of being airborne 
from dusk to dawn several nights a week, gathering science on key 
astronomical questions. Too little too late, NASA said — years of 
delays mean that SOFIA has missed its chance to overlap with other 
infrared space observatories and, the agency claimed, provide suf- 
ficient science return. 

But there is a deeper issue: NASA is not the only participant in 
SOFIA. From the project’s start, the German aerospace centre DLR 
has paid one-fifth of the cost, granting German scientists access to 
20% of its observing time. When NASA pulled out, the DLR was left 
hanging high and dry. 

What will happen next is far from certain, and will be dictated by 
complex financial negotiations between various branches of the US 
government. If NASA sticks to its plan, project managers must wind 
down all science flights and mothball SOFIA by 30 September, the 
end of the current fiscal year. But the $12 million currently budgeted 
is not enough to do even that job, according to a 15 April report from 
the Government Accountability Office. 


f or sale: one flying observatory, barely used. Price: US$1 mil- 


It is possible that the US Congress will restore some amount of 
money to the project, overriding NASA% request. Intense lobbying is 
now under way to sway the minds of key members of Congress, such 
as those who represent SOFIA workers in California. This may well 
succeed, granting the project some fraction of NASA% original budget, 
which could be enough to keep it limping along on a reduced schedule 
and with fewer staff. 

A third option is that some as-yet-unknown partner might emerge 
from the shadows to pick up some of NASA’s $80-million-a-year share. 

Hence the offer to rent it out. Nature would 


“NASA must be delighted to be proved wrong, but demand 
expect other for that seems unlikely. Germany, for one, has 
countries to already told NASA it cannot pay more than 
asksometough __ itisalready. 

questions before What happens to SOFIA has ramifications 
they sign up far beyond the world of infrared astronomy. 
to cooperative It has major implications for any country that 


might want to share scientific projects with 
NASA, and for the responsibility that NASA 
has to honour those agreements. The deal between NASA and the 
DLR contains a clause that allows one partner to propose shuttering 
the observatory. German officials and scientists were understandably 
furious at NASA’ unilateral decision to shut it down. 

During its long and chequered history, SOFIA has come close to 
being cancelled before. And there are many valid reasons to question 
its scientific return per dollar spent — or whether it should be flying 
at all (see Nature 466, 413; 2010). But what is happening with SOFIA 
is circumventing the usual process for making hard decisions in tight 
fiscal times. NASA has ‘decadal surveys’ to set priorities for future 
missions, and ‘senior reviews to check the worth of operational ones 
and to halt those that do not make the cut. 

For the moment, SOFIA continues to fly as normal. A NASA/DLR 
working group is outlining options for how to wind down the pro- 
gramme; its report is due by the end of April, and is likely to read like 
the order of service for a funeral. 

No matter whether Congress — or some benefactor — steps in 
to help, the breach of trust with the DLR is irrevocable. NASA must 
expect other countries to ask some tough questions before they sign 
up to cooperative projects in the future. m 


projects.” 


Take care 


The United States must tread carefully when 
building a health- data system. 


ith last year’s revelations about the National Security 
W eens intrusive spying practices, and the disastrous roll- 
out of the US health-insurance website HealthCare.gov, now 
may not seem the best time for the United States to establish a national 
network to collect and analyse health data from millions of patients. 
Good luck, then, to the government-backed Patient-Centered 
Outcomes Research Institute (PCORI), which is trying to do just that. 
In December, the institute in Washington DC launched the 
PCORnet project to collate medical records, physiological data and 
insurance claims from as many as 30 million US patients through 
29 web networks — a dream cohort for any biomedical researcher. 
Last month, the organization began building the computer infra- 
structure that will tie these records together, allowing researchers 
to spot trends in how lifestyles affect health, which drugs are most 
effective for certain types of individual, and so on. If it works as 
advertised, these findings could save millions of dollars by allowing 
researchers to recruit the right participants for clinical trials and 
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doctors to prescribe the right treatments for individuals. 

Designing a US$94-million programme that draws big data from 
dozens of computer systems is an enormous challenge — just ask 
former Health and Human Services secretary Kathleen Sebelius, 
who resigned last week after months of technical problems with 
HealthCare.gov. But the ethical issues may prove an even higher hur- 
dle. The United States would do well to watch the United Kingdom 
as it grapples with creating its own national database, care.data (see 
Nature 507, 7; 2014). Although the UK national health system is much 
more mature and cohesive than anything the United States has, the 
project's rollout has been delayed for months because of unanswered 
questions about informed consent and access to sensitive patient data. 

Even if PCORI can solve its technical and ethical problems, it will 
be some time before the project measures up to systems such as Den- 
mark’s MedCom, which collates comprehensive patient data going 
back to 1977 and provides an indispensable resource for researchers 
doing longitudinal studies. The system seems to work, largely because 
of its transparency about how the data are used. 

PCORI says that it will let patients help to determine research priori- 
ties, and promises that only anonymous data and no individual records 
will be available to researchers. It is crucial 
that the institute follows through, to maintain 
patients’ trust. Without that, neither researchers 
nor patients will benefit. As the old saying goes, 
you make your own luck. m 
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WORLD VIEW .ecninicos sen 


Technology prizes, for researchers in Islamic countries. The creation 

of the four US$500,000 prizes illustrates the growing importance to 
the Iranian leadership of domestic science and the nurturing of scientific 
cooperation and exchange with other nations. 

The Iranian government has already developed a cadre of distin- 
guished scientists. State-run media routinely vaunt advances made by 
Iranian researchers, often new military systems. Iran hosted the Fifth 
International Conference of Cognitive Science in Tehran last year. 
And Hessamaddin Arfaei, a physicist at the Institute for Research in 
Fundamental Sciences in Tehran, visited the United States in April 
2005 to deliver a lecture at a meeting of the American Physical Society 
and to continue his efforts “to establish interaction between Iranian 
scientists and the international scientific commu- 
nity”. His trip was presumably carried out with the 
blessing of Iranian officials. 

But these efforts are being subverted by other 
sectors of Iran’s ruling establishment, which are 
determined to ferret out ‘traitors’ among the coun- 
try’s most promising scientists. As Iran's interest in 
science and demand for international cooperation 
grows, researchers abroad have an opportunity to 
tip the balance of power away from these hardliners. 

Too often, scientists in Iran are victims ofa fantasy 
that there is a massive campaign orchestrated by the 
US government (including the CIA), Israel’s Mos- 
sad and the British intelligence services to launch a 
‘Velvet Revolution (Enghelab-e Makhmali) to under- 
mine the foundation of the Islamic Republic of Iran. 
To achieve their aim, the forces behind the conspir- 
acy are alleged to be recruiting high-profile Iranians 
through academic exchanges, by attracting them to attend conferences 
abroad or by inviting them to hold residency at Western institutions. 

Hardline Iranian newspapers and websites recite every known or 
imagined contact between Iranian scholars and their European and 
US colleagues, no matter how benign, including research grants and 
visiting fellowships at universities and private foundations. The con- 
spiracy about the Velvet Revolution, and the concomitant requirement 
to seek out and uncover ‘enemies’ — the higher profile the better — 
serves only the interests of the security agencies and the hardliners, who 
can justify the power and scope of the elaborate (and costly) security 
apparatus by claiming its necessity as a bulwark against those aiming 
to destroy the country. 

The physicist Omid Kokabee, for example, was arrested on spuri- 
ous charges of crimes against national security 


[= officials last month unveiled the Mustafa Science and 


in 2011 and given a ten-year prison sentence NATURE.COM 
in 2012. Kokabee, who was pursuinga PhD in _ Discuss this article 
quantum optics at the University of Texas at _ online at: 

Austin, had resisted repeated invitations from _go.latuire.com/sngks4 


SCIENTIFIC 
DIPLOMACY CAN 
HELP TO 


BUILD 
BRIDGES 


AND DEFUSE 
INTERNATIONAL 


TENSION. 


Iran needs to present a 
united front on science 


Iranian scientists must be allowed to interact abroad without fear of 
persecution, says Elise Auerbach. 


Iranian authorities to work on military and intelligence projects. His 
claimed crimes were communicating with a hostile government and 
receiving illegitimate funds — an apparent reference to the normal 
stipend given to graduate students at his university. 

His case and others like it must have a chilling effect on other Iranian 
scientists engaged in legitimate interactions with colleagues in the 
West. This inhibits the advancement of science in Iran and seems to 
be utterly counterproductive to the state-endorsed efforts to promote 
Iran’s scientific achievements internationally. 

The more moderate members of Iran’s ruling establishment surely 
understand that the country’s successful integration into the global 
community is undermined if its best and brightest scholars fear the 
consequences of engaging in normal interactions with colleagues 
abroad. The reality of Iran’s ‘brain drain’ problem 
was recognized by its science and technology minis- 
ter, Reza Faraji Dana, who noted in January: “Every 
year, about 150,000 of our elite emigrate from Iran, 
costing our economy $150 billion” 

The harsh penalties imposed on Kokabee and 
others have earned Iran the condemnation not only 
of the human-rights organizations campaigning on 
his behalf, but also of scientific and scholarly soci- 
eties — including the American Physical Society, 
with which the country is otherwise engaged in a 
dialogue to improve opportunities for collaboration. 

Scientific diplomacy can help to build bridges 
and defuse international tension. In an article titled 
“Which Iran Will We Choose?; published in Decem- 
ber by the The WorldPost, academics Trita Parsi, 
Bijan Khajehpour and Reza Marashi argued that 
sustained engagement with Iran on science projects 
“can spill over to other crucial areas, such as the human rights situation” 

A necessary precondition for such collaboration must be the right 
of scientists to engage in their normal professional activities without 
incurring persecution. Iran's scientists, its population and the broader 
scientific community would all benefit from Kokabee’s release. Cam- 
paigns to free Iranian scientists persecuted in this way have succeeded in 
the past. Brothers Arash and Kamiar Alaei, both HIV/AIDS physicians, 
were arrested in 2008 and convicted of “communicating with an enemy 
government”. They were released following international pressure. 

By offering the prospect of shared scientific enterprises on one hand, 
and advocating for an end to the persecution of scholars on the other, 
science diplomacy could help to convince decision-makers in Iran that 
the benefits of international scientific cooperation greatly outweigh any 
value in placating those who prefer to perpetuate a giant conspiracy. = 


Elise Auerbach is the Iran country specialist with Amnesty 
International USA in Chicago, Illinois. 
e-mail: aiis@uchicago.edu 
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RESEARCH HIGHLIGHTS 


Ancient impact 
recorded in rock 


South African rocks reveal 
that a huge asteroid smashed 
into Earth 3.26 billion years 
ago, turning sediments to 
liquid. 

The rocks in the Barberton 
greenstone belt in South 
Africa contain tiny blobs 
of material that condensed 
from clouds of rock vapour 
generated by ancient impacts. 
Norman Sleep and Donald 
Lowe of Stanford University 
in California used those blobs, 
along with other features of the 
Barberton rocks, to calculate 
the size and geological effects 
of this particular asteroid 
impact. 

The duo found that the 
asteroid was probably 
50 kilometres across. Seismic 
waves from the impact 
fractured much of Earth's 
crust, triggering strong 
earthquakes and giant 
tsunamis, the authors say. 
Geochem. Geophys. Geosys. 
http://doi.org/sb4 (2014) 


‘Extinct?’ plankton 
found in Pacific 


A plankton species thought to 
have gone extinct more than 
a million years ago has been 
found living in the western 
Pacific Ocean. 


Kenneth Mertens of Ghent 


Selections from the 
scientific literature 


ANTIMICROBIALS 


Biocide boosts bacterial binding 


People exposed to an antimicrobial compound 
used in many consumer products are more 
likely to carry an opportunistic pathogen, 
increasing their risk of infection. 

The bacterium Staphylococcus aureus 
(pictured in green) is normally harmless in 
healthy adults but can cause serious infections 
in sick or injured people. A team led by Blaise 
Boles at the University of Michigan in Ann 
Arbor found that people with higher levels of 
the biocide triclosan in their nasal secretions 


University in Belgium and 
his colleagues discovered live 
cysts of the dinoflagellate 
plankton Dapsilidinium 
pastielsii (pictured) in the 
sea floor from Japan to the 
Philippines and Indonesia. 
This stably warm region, 
known as the Indo-Pacific 
Warm Pool, harbours high 
levels of biodiversity. 

The warm waters there 
could have provided a haven 
for D. pastielsii to survive in the 
water column and sediment. 
Such areas of refuge will be 
important for maintaining 
biodiversity in a changing 
climate, the authors say. 
Geology http://doi.org/sb7 (2014) 
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had S. aureus colonies in their noses more 
frequently than people with little or no triclosan 
in their secretions. Triclosan, which is used in 


toothpastes, soaps and medical equipment, 


and keratin. 


Fruit-fly microbes 
draw more flies 


Researchers have pinpointed 
a chemical source that attracts 
fruit flies to food: the gut 
bacteria of fruit-fly larvae 
sitting on the food. 

Reuven Dukas and his 
colleagues at McMaster 
University in Hamilton, 
Canada, previously showed 
that fruit flies (Drosophila 
melanogaster) are attracted to 
food occupied by larvae. Now 
they find that the flies prefer 
food that has been inhabited 
by larvae with intact gut 
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increased the binding of S. aureus to plastic, 
glass and human proteins such as collagen 


The results suggest an urgent need to 
re-evaluate the use of triclosan in consumer 
products, the authors say. 
mBio 5, e01015-13 (2014) 


microbiomes rather than by 
bacteria-free larvae. 

The authors also show that 
this attraction is not related 
to the presence of gut bacteria 
or larvae in the food, but to 
the physical changes that the 
feeding larvae make. Used food 
is easier for alarva to burrow 
into than fresh food, they say. 
J. Exp. Biol. 217, 1346-1352 (2014) 


Earth twin spotted 
in habitable zone 


Astronomers have discovered 
a planet that is a similar size 
to Earth orbiting a cool, dim 


SPL 


YOSHIHITO TAKANO 


YOSHIZAWA ET AL./CURRENT BIOLOGY 


star at just the right distance 
for liquid water to exist. It is the 
most Earth-like planet found 
so far that could potentially 
host life. 

A team led by Elisa Quintana 
of the SETI Institute in 
Mountain View, California, 
discovered the planet, 
Kepler-186f, by looking at data 
from NASAs Kepler space 
telescope. Slight drops ina star’s 
brightness revealed the orbiting 
planet, the fifth known in that 
particular system. 

Kepler-186f is just 1.1 times 
the size of Earth and orbits 
its star every 130 days. It is 
towards the outer edge of the 
star’s habitable zone — in 
our Solar System, it would be 
equivalent to putting Earth out 
near the orbit of Mars. 

Science 344, 277-280 (2014) 
For a longer story on this research, 
see go.nature.com/tmbgje 


MICROBIAL EVOLUTION 


How a flesh-eater 
evolved 


A deadly wave of flesh-eating 
bacteria emerged in the 1980s 
when a single cell gained 

a pair of toxin genes from 
another bacterium. 

Infections with group A 
Streptococcus (GAS) strains, 
which cause necrotizing 
fasciitis, spiked worldwide in 
the late 1980s and early 1990s. 
To piece together the sequence 
of molecular events behind 
this epidemic, James Musser 
at the Houston Methodist 
Research Institute in Texas 
and his team sequenced the 
genomes of 3,615 GAS strains 
isolated from Europe and 
North America between 1969 
and 2013. 

Analysis of the genomes 
revealed a series of genetic 
changes that began to occur 
in the early 1970s. The final 
changes that took place before 
the strain became a voracious 
flesh-eater happened in 
around 1983, when less- 
virulent strains acquired 
the genes that are needed 
to produce certain toxins in 
humans. 

Proc. Natl Acad. Sci. USA 
http://doi.org/sdc (2014) 


PHYSICS 


Liquid bubbles 
stop sound 


Liquid foam can block the 
transmission of ultrasound, 
report Valentin Leroy at Paris 
Diderot University and his 
colleagues. 

The French team 
created the foam by 
bubbling an insoluble gas, 
perfluorohexane, into water 
containing a surfactant, and 
then sent ultrasound through 
the foam. 

The elastic, thin films at 
the edges of adjacent foam 
bubbles reflected the sound 
waves. The exact frequencies 
that were blocked varied with 
the average bubble size, the 
biggest of which measured up 
to 50 micrometres in radius. 

The findings show that 
foams can act as acoustic 
metamaterials — materials 
that block sound waves 
because of their unique 
structure. 

Phys. Rev. Lett. 112, 148307 
(2014) 


MICROBIOLOGY 


Bacteria elbow out 
the competition 


Some microorganisms push 
others out of the way to gain 
better access to oxygen. 

Wook Kim and Kevin 
Foster at the University of 
Oxford, UK, studied the soil 
bacterium Pseudomonas 
fluorescens, which grows in 
dense layers called biofilms. 

They found that a strain 
that consistently outgrew 
others and dominated the 
colony carried a mutation 
ina gene called rsmE. This 
mutant strain secretes 
polymers that expand, 
nudging neighbours aside 
and allowing cells to reach 
the colony surface where 
they can receive the most 
oxygen. 

Life in tightly packed 
communities has selected for 
this bacterial pushiness, the 
authors say. 

Proc. Natl Acad. Sci. USA 
http://doi.org/sb5 (2014) 
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SOCIAL SELECTION 


Popular articles 
on social media 


Illegal fishing hooks online attention 


Researchers on Twitter were astounded by some of the figures 
given in a paper describing extensive illegal fish imports into 
the United States. According to Tony Pitcher at the University 
of British Columbia in Vancouver and his colleagues, in 

2011 more than 20% of wild-caught seafood coming into the 
country — worth between US$1.3 billion and $2.1 billion 

— was from illegal or unreported fishing. To make their 
estimates, the researchers used data from the US National 
Marine Fisheries Service for 30 major seafood products from 
10 countries. Illegal, unreported and unregulated fishing 
disrupts ecosystems, food security and livelihoods around the 


world, the authors say. 


Mar. Policy 48, 102-113 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


ZOOLOGY 


Sex changed in 
Brazilian insect 


Female insects in Brazilian 
caves initiate reproduction, 
and they do so with a penis. 
Kazunori Yoshizawa at 
Hokkaido University in 
Sapporo, Japan, and his 
colleagues studied four 
Neotrogla species, one of 
which copulates for up to 
70 hours. The authors found 
that the female uses an 
external organ (pictured) 
during sex to penetrate the 
male and to receive sperm 
for both reproduction and 
nutrition. All of the species 
have spines on the penis-like 
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structure, which the females 

use to anchor themselves in 

the male genital chamber. 
The competition between 

females for mates might have 

driven the evolution of this 

elaborate sexual structure, 

the authors say. However, 

this is the only known case of 

reversed sex organs in animals 

with reversed sexual roles, 

they add. 

Curr. Biol. http://dx.doi. 

org/10.1016/j.cub.2014.03.022 

(2014) 

For a longer story on this research, 

see go.nature.com/ptnetb 
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SEVEN DAYS nescwnss 


NIH grant recycling 


The US National Institutes of 
Health is to allow researchers 
to base new grant applications 
on ideas that have previously 
been rejected for funding. 

The agency announced the 
policy on 17 April (go.nature. 
com/43cibf). It modifies a 
2009 rule that any completely 
new grant application must 

be substantially scientifically 
different from one that has 
been rejected. The change 

is in response to complaints 
that the 2009 policy penalized 
young researchers whose work 
cannot easily change direction 
to generate completely new 
proposals. 


Emissions down 

US emissions of six major 
heat-trapping gases decreased 
by 3.4% between 2011 and 
2012, to the equivalent of 
6,526 million tonnes of carbon 
dioxide, the US Environmental 
Protection Agency said on 

15 April. Energy consumption 
dropped in every sector of 

the US economy, the agency 
said, because more energy 

was generated from natural 
gas than from coal, among 
other factors. Emissions 

from transport also fell as 

fuel efficiency increased and 
demand for new vehicles 
declined. 


Environment boost 
Some 30 nations pledged 
US$4.43 billion on 16 April 
to help developing countries 
to protect the environment 
from degradation. The 
funding will be distributed 
over the next four years by 
the Global Environment 
Facility — an international 
partnership of governments, 
civil-society organizations 
and companies that supports 
sustainable development and 
environmental treaties such as 
the Convention on Biological 


I. rex comes to Washington 


A near-complete 66-million-year-old 
Tyrannosaurus rex skeleton (right femur 
pictured) arrived at the US National Museum 
of Natural History in Washington DC on 

15 April. It is on loan for 50 years from the US 
Army Corps of Engineers. The fossil is one of 
the largest and most complete T: rex specimens 
ever discovered, with up to 85% of the skeleton 


recovered after it was found on federal land 

in 1988. It will become the centrepiece of the 
museum's new 2,900-square-metre dinosaur and 
fossil hall, which is scheduled to open in 2019 
and will house examples from the museum's 
collection of 46 million fossils. The hall will be 
named after philanthropist David Koch, who 
donated US$35 million to the project. 


Diversity. The funding will 

go towards projects in more 
than 140 countries, aimed at 
tackling threats such as species 
extinction, toxic chemicals 
and deforestation. 


Cash for innovation 


The European Parliament 
voted on 15 April in favour 
ofa €22-billion (US$30- 
billion) programme to fund 
research and innovation 
partnerships between 
academia and industry. 

The Innovation Investment 
Package is funded by the 
European Union, industry 
and member states. Much of it 
will support six public-private 
partnerships, revamping the 
Joint Technology Initiatives 

of Europe's previous 
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multi-year research-funding 
programme. The package will 
provide competitive grants 

in innovative medicines, 
aeronautics, bio-based 
industries, fuel cells and 
hydrogen, and electronics 
components. Member states 
are expected to approve it in 
time for a proposed July launch. 


Pollution survey 

Just under one-sixth of China’s 
land is polluted, according 

to the first nationwide soil 
survey, released on 17 April by 
the Ministry of Environmental 
Protection and the Ministry 
of Land and Resources. Much 
of the pollution comes from 
cadmium, nickel and arsenic 
released by industry and 
agriculture, the analysis found. 
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The survey ran from April 
2005 to December 2013 and 
covered 6.3 million square 
kilometres — two-thirds of the 
nation’s land area. 


GM-labels law 


Vermont looks set to become 
the first US state to enact a 
law requiring the labelling of 
foods containing genetically 
modified (GM) ingredients. 
On 16 April, the state’s 

senate passed a bill in favour 
of labelling. The house of 
representatives is expected 

to agree to amendments to 
the bill, which has already 
passed through the house 

and senate once. Vermont's 
governor supports GM labels. 
Connecticut and Maine passed 
similar laws last year, but both 


CHIP SOMODEVILLA/GETTY IMAGES 


ESA 


SOURCE: BNEF 


contain clauses that stop them 
from coming into effect until 
neighbouring states follow suit. 


LADEE crashes 


NASAs Moon-dust probe 
shattered into clouds of dust 
when it hit the far side of the 
Moon, as planned, in a mission- 
ending impact on 18 April. 
The Lunar Atmosphere and 
Dust Environment Explorer 
(LADEE) finished its duties 
in March. Mission controllers 
directed the crash to protect 
historic lunar locations such 
as the Apollo landing sites 
from probe debris. During 

its lifetime, LADEE made 

the best measurements yet 

of lunar dust, generated as 
tiny meteorites bombard the 
Moon’s surface. See go.nature. 
com/mwgoln for more. 


Weapons update 
Syria has removed or destroyed 
around 80% of its chemical- 
weapons materials, a joint 
mission of the Organisation 
for the Prohibition of 
Chemical Weapons and the 
United Nations announced 
on 19 April. The remaining 
materials should be removed 
over the next few days, the 
mission says; all materials 
must be destroyed by 30 June. 
The process began in January 
under an agreement 
brokered by Russia and 

the United States. 


TREND WATCH 


Clean-energy analysts hope 
that a promising first quarter 


to 2014 foreshadow a rebound 
for dipping global investments. 
Figures released by Bloomberg 
New Energy Finance showed 
investment rising 10% on the 
same period last year, driven 
mainly by small-scale solar 
projects. There was a notable 
geographic shift too, with two 

of the largest deals occurring in 
Indonesia (US$1.6 billion for the 
Sarulla geothermal project), and 
Kenya ($860 million for the Lake 
Turkana wind project). 


Sentinel shots 


The European Space Agency 
published the first images 
from its Sentinel-14A satellite 
on 15 April. This image shows 
the city of Brussels and its 
surroundings. Green correlates 
to vegetation; pink to urban 
areas; white to high-density 
urban areas; and black to 
waterways and low-reflective 
areas such as airport runways. 
The image was captured by the 
satellite on 12 April, nine days 
after it launched. 


Egyptian satellite 


Egypt’s second remote- 
sensing satellite launched 

on 16 April ona mission to 
collect agricultural, geological 
and environmental data from 
northern Africa. The Russian- 
built EgyptSat-2 sailed into 
space aboard a Soyuz launch 
vehicle from the Baikonur 
Cosmodrome in Kazakhstan. 
The probe replaces 
EgyptSat-1, which lasted 


for three years after its 2007 
launch. Data from the second 
satellite will be handled by the 
Egyptian National Authority 
for Remote Sensing and Space 
Sciences near Cairo. 


| __BUSINESS 
SpaceX delivery 


SpaceX, the private spaceflight 
company in Hawthorne, 
California, made its third 
successful cargo delivery 

to the International Space 
Station on 20 April, as part of 
a US$1.6-billion contract with 
NASA to make a dozen drops 
to the space station by 2016. 
The flight was delayed from 
March because of technical 
difficulties. It carried science 
experiments and supplies 
including high-definition 
cameras for viewing Earth, 
and equipment to test why 

the immune system becomes 
suppressed in microgravity. 


Gene-cutter patent 
Anacclaimed method for 
editing DNA sequences 

has won its first patent. On 

15 April, the Broad Institute 

in Cambridge, Massachusetts, 
announced that it had patented 
an engineered CRISPR-Cas9 
system. The ‘clustered regularly 
interspaced short palindromic 
repeats’ process uses an 
engineered bacterial enzyme 

to target genes, making precise 
cuts in their DNA sequence 
and thereby enabling them to 
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Global investments reached US$47.7 billion in the first quarter of 2014. 
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SEVEN DAYS | THIS WEEK | 


29 APRIL 

The first solar eclipse of 
the year takes place. A 
bright ring of sunlight 
will surround the dark 
Moon in this annular 
eclipse, which will 

be best viewed from 
Antarctica. 
go.nature.com/q3dviq 


30 APRIL 

The World Health 
Organization 

publishes a report on 
global surveillance of 
antimicrobial resistance. 
It will present data on 
antibiotic resistance in 
nine bacteria responsible 
for common maladies 
such as diarrhoea and 
bloodstream infections. 


be modified. The system was 
devised by molecular biologist 
Feng Zhang at the Broad. Last 
November, Zhang co-founded 
a company, Editas Medicine in 
Cambridge, to develop ways of 
using the CRISPR technology 
for therapies (see go.nature. 
com/c8jopb). 


RNAi drugs battle 
The withdrawal of the 
pharmaceutical firm 

Novartis from RNA 
interference (RNAi)-based 
drug development does not 
spell the end for the field, a 
biotechnology executive says. 
RNAi-based drugs use snippets 
of RNA to shut down the 
production of disease-causing 
proteins. In an interview 

with Nature on 15 April, John 
Maraganore, chief executive 
of RNAi specialists Alnylam 
in Cambridge, Massachusetts, 
said that the announcement on 
14 April by Novartis of Basel, 
Switzerland, indicated that 
“large companies don't doa 
good job with the technology”. 
The field is actually thriving, 
he says. See page 443 for more. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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The drug Tamiflu is prescribed as the front-line treatment for serious cases of influenza. 


EPIDEMIOLOGY 


Tamiflu report 
comes under fire 


Conclusions on stockpiling of antiviral drugs challenged. 


BY DECLAN BUTLER 


study that calls into question the 
Avessiin of billions of dollars’ worth 

of antiviral drugs to mitigate the threat 
of influenza pandemics has been criticized by 
flu researchers. 

The analysis of Tamiflu and Relenza, drugs 
known as neuraminidase inhibitors, was pub- 
lished on 10 April by the Cochrane Collabo- 
ration’, a group that reviews the effectiveness 
of health-care measures. It concluded that the 
medicines were of little use. At the same time, 
the journal BMJ published a series of articles, 
including two that summarize the Cochrane 
findings””, and several editorials that focus on 
the five-year campaign by Cochrane and the 
BMJ to obtain the unpublished drug-company 
clinical-trial data later used in the review. 

The results “challenge the historical assump- 
tion that neuraminidase inhibitors are effective 


in combating influenza’, declared a joint BMJ- 
Cochrane news release on the findings. The 
drugs have had their “effectiveness overplayed, 
and harms underplayed’, said Fiona Godlee, 
the BM)’s editor-in-chief, at a press conference. 
The study generated worldwide media cover- 
age, including headlines labelling Tamiflu as 
“useless” and “ineffective”. 

But the review and its bottom line are vig- 
orously contested by many flu researchers. 
They argue that the analysis — an update by 
Cochrane — is based on randomized clinical 
trials (RCTs) of the drugs that lack sufficient 
statistical power to allow reliable conclusions 
to be drawn about the effects on flu complica- 
tions and hospitalizations. These are the key 
outcomes of interest during a flu pandemic. 

The critics say that the review also excluded 
many observational studies that have found the 
drugs to be helpful in normal clinical settings. 

Tamiflu is prescribed as the main treatment 
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for serious cases of flu, and researchers worry 
that the media storm risks undermining public 
confidence in this class of drug. “We risk los- 
ing one of the few weapons we have, because of 
overly negative publicity,’ says Peter Openshaw, 
director of the Centre for Respiratory Infection 
at Imperial College London. 

The review by Cochrane, a non-profit organ- 
ization in London, considered both Tamiflu 
(oseltamivir), produced by the drug company 
Roche in Basel, Switzerland, and Relenza 
(zanamivir), marketed by GlaxoSmithKline 
in Brentford, UK. It is one of the first analyses 
to review clinical-study reports — documents 
that companies submit to regulators and that 
contain fuller trial data than published RCTs. 

Both drugs have been found to shorten the 
duration of flu-like symptoms, and to be effec- 
tive in treating severe disease; what is in dispute 
is whether they reduce hospitalizations and 
severe complications. The Cochrane authors 
and the BMJ claim that there was no evidence 
for such gains. They also say that the findings 
bring into question the decision by govern- 
ments in the mid-2000s to stockpile the drugs 
against the threat of a pandemic of the H5N1 
avian-flu virus, which has a mortality rate of 
around 60%. Carl Heneghan of the University 
of Oxford, UK, a co-author of the review, said 
that there was “no credible way these drugs 
could prevent a pandemic” and that stockpiles 
were “money thrown down the drain”. 

But a dozen experts contacted by Nature’s 
News team said that the clinical-study reports, 
although they allow more detailed analyses, 
offer no substantial fresh findings. 

“The Cochrane authors have done a thor- 
ough review, and deserve credit for their efforts 
to obtain raw clinical data from Roche and 
regulatory authorities, but ultimately its find- 
ings are not surprising,” says Peter Horby, a flu 
researcher at the University of Oxford Clinical 
Research Unit in Hanoi, Vietnam. 

Jody Lanard and Peter Sandman, independ- 
ent risk-communication experts in New York, 
say that the press release on the review omits 
findings that in their opinion are key. For exam- 
ple, it rounds down Tamiflu’s reported 17-hour 
reduction in symptom duration in adults to 
“just half a day’, and describes the reduction 
as “small” It also does not report the 29-hour 
reduction in children. Lanard and Sandman 
claim that there has been “cherry-picking of the 
results to make them look worse for antivirals”. 

Horby’s view is that the BMJ-Cochrane > 
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> press release and public statements have 
contributed to media misinterpretation of 
what the study does and does not say. “The 
review does not state the drugs are ‘ineffec- 
tive’ or ‘useless,” as was reported in some 
media stories, he says. 

In a joint response to Nature, BMJ- 
Cochrane said that they felt the press release 
did “a good job of presenting the main 
findings” They defend its presentation of 
the reduction in duration of symptoms as 
justified, and note that the fuller outcomes 
were included in the review’s summary of its 
findings. The press coverage, they say, was 
“not bad in terms of balance and accuracy”. 

The two organizations agree that the 
limited statistical power of the trial data on 
the numbers of hospitalizations and com- 
plications makes it difficult to draw reliable 
conclusions on these. But they argue that 
their findings of no evidence for any effects 
challenge past analyses of trials that found 
that the drugs reduced complications and 
hospitalizations. They claim that the review 
did show new findings, for example data on 
side effects. 

RCTs are considered the gold standard for 
establishing the effectiveness of drugs, and 
Cochrane restricts itself to meta-analyses 
of these in its review. But critics note that 
these small clinical trials were carried out 
to gain regulatory approval for Tamiflu as 
treatment and prophylaxis for seasonal flu, 
which in most cases is mild. This meant that 
the healthy trial subjects rarely developed 
complications. In other words, the trials 
were not designed to test for the severe out- 
comes that are most relevant to pandemics. 

The critics add that observational studies 
of how large numbers of people respond to 
treatments under normal medical care can 
also provide important information on the 
effectiveness of drugs. In March, for exam- 
ple, an observational study’ of 30,000 people 
hospitalized during the 2009-10 swine-flu 
pandemic reported that neuraminidase 
inhibitors reduced mortality by 25%. 

But because of its policy of reviewing only 
RCTs, Cochrane did not include observa- 
tional data. “Given the limited number of 
RCTs, and the considerable evidence base on 
the effectiveness of neuraminidase inhibi- 
tors over the past ten years or more, it is diffi- 
cult to justify exclusion of the observational 
evidence,’ says Ben Cowling, a flu epidemi- 
ologist at the University of Hong Kong. 

BMjJ-Cochrane said they exclude all 
observational studies because they are “unre- 
liable for establishing treatment effects”. m 


1. Jefferson, T. et al. Neuraminidase Inhibitors for 
Preventing and Treating Influenza in Healthy 
Adults and Children (Review) (Wiley, 2014). 

2. Jefferson, T. et al. Br Med. J. 348, g2545 (2014). 

3. Heneghan, C. J. et al. Br Med. J. 348, g2547 
(2014). 

4. Muthuri, S. G. Lancet Resp. Med. http://dx.doi. 
org/10.1016/S2213-2600(14)70041-4 (2014). 


440 | NATURE | VOL 508 | 24 APRIL 20 


14 


bs 
be 


a 


People walk along a road damaged in the 1 April Chilean earthquake. 


SEISMOLOGY 


Chile quake defies 
expectations 


Smaller-than- expected tremor has scientists scrambling 
to redefine rules for areas of extreme seismic stress. 


BY ALEXANDRA WITZE 


onika Sobiesiak wasn't expecting 
Me morning of 2 April to start with 

such an adrenaline jolt. But as she 
scrolled through a list of earthquakes on her 
mobile phone, she saw that overnight a series 
of quakes had rocked the coast of northern 
Chile — almost exactly where she had installed 
a seismometer network a few years earlier. “I 
saw the 8.2,” says the geophysicist, who works 
at the University of Kiel in Germany, “and I 
rushed to get to my desk” 

That 1 April quake, which struck offshore 
near the village of Pisagua, was the largest in 
Chile since a magnitude-8.8 quake hit farther 
south in 2010. Although the Pisagua quake was 
not as big and not particularly damaging, it will 
still go down in the annals of seismology — as 
an intensively studied earthquake that upends 
some assumptions about how and when big 
quakes happen. 

In one sense, seismologists knew it was com- 
ing. Northern Chile, near the border with Peru, 
was the only stretch of the country’s coastline 
that had not broken in a large earthquake 
in the past century (see “Under pressure’). 
In 2006, expecting it to go, a German- 
French-Chilean collaboration blanketed the 
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region with seismometers, tiltmeters and other 
ground-measuring instruments, creating the 
Integrated Plate boundary Observatory Chile 
(IPOC). It captured the Pisagua quake in 
action, as did Sobiesiak’s network. 

But the earthquake was not the ‘Big One’ that 
seismologists had expected. Only a monstrous 
earthquake, of around magnitude 9, would 
have relieved all the geological stress built up 
in the region. More quakes, on the order of 

magnitude 8, are still 


“A lot of energy possible, but when 
remains to be they might strike 
releasedin is a mystery. More 


north Chile.” broadly, the Pisagua 
event has seismolo- 
gists rethinking some basic ideas about the risk 
of earthquakes in similar geological settings 
elsewhere — places with deep-diving crustal 
plates, such as Japan and Indonesia. 

Over time, earthquakes rupture particular 
portions of a long fault zone; the unbroken 
portions are ‘seismic gaps’ considered ripe for 
future quakes. Officials in these areas are often 
told to prepare for the worst-case scenario — 
the biggest possible earthquake in a given 
seismic gap. But the Pisagua quake shows that 
this does not always happen, says Susan Beck, 
a seismologist at the University of Arizona in 
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Tucson. Instead, it underscores that seismic 
gaps can rupture in all sorts of ways, from lots 
of smaller quakes to just a few big ones. 

Chile is an ideal laboratory in which to study 
such questions because it lies on the margin of 
a subduction zone, where the Nazca tectonic 
plate dives — or subducts — beneath the South 
American plate. Geological stress builds up 
and then is released in the occasional massive 
jolt. Chile is home to the largest earthquake 
ever recorded — one of magnitude 9.5 in 1960 
— and accounts for more than one-quarter of 
the planet’s total seismic-energy release. 

Pisagua had not seen a major earthquake 
since 1877, when a tremor of around mag- 
nitude 9 ripped through the area. Seismic 
activity began to pick up last August, when a 
swarm of small earthquakes struck the area. 
Another set followed over the new year, and a 
third cluster occurred in March. These three 
swarms seem to have prepared the subduction 
zone to rupture in the big 1 April quake, says 
Onno Oncken, a geophysicist at the German 
Research Centre for Geosciences in Potsdam. 

Until recently, researchers had thought that 
the next large earthquake in northern Chile 
would break the entire interface between 
the Nazca and South American plates, says 
Jean-Pierre Vilotte, a seismologist at the 
Paris Institute of Earth Physics. The Pisagua 
quake and a magnitude-7.6 aftershock two 
days later, “are a clear counterexample of this 


simplistic classification’, he says. Together they 
ruptured just a small portion of the entire 
region at risk. 

Intriguingly, the part of the subduction zone 
that broke was not the part that had built up 
the most stress, according to a ground-motion 
study of northern Chile's seismic gap by Mari- 
anne Métois, a geophysicist at the National 
Institute of Geophysics and Volcanology in 
Rome, and her colleagues (M. Métois et al. 
Geophys. J. Int. 194, 1283-1294; 2013). For 
some reason, the Pisagua quake released stress 
in areas that were not the most wound up. “A 
lot of energy remains to be released in north 
Chile,” Métois says. 

When the next one comes, seismologists 
plan to be ready. IPOC has added instruments 
to capture aftershocks from the 1 April quake 
and whatever might happen next. Dozens of 
new seismometers and global-positioning sta- 
tions have been deployed by teams from Chile, 
Germany and France, says Sergio Barrientos, 
director of the National Seismological Centre 
at the University of Chile in Santiago. 

For Sobiesiak, every little bit of data helps. 
Before the 1 April quake, she thought that 
the northern Chile seismic gap would rup- 
ture either to the north or to the south of the 
Pisagua area, but not right through it. Now 
she has some fresh thinking to do. “Each of 
these efforts really does bring us a step 
forward,” she says. = 
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Scientists had long 
assumed that only a major 
quake would relieve a 
century of pent-up stress 
in northern Chile’s ‘seismic 
gap’. But the tremor that 
struck on 1 April was 
smaller than expected. 


SOURCE: R. PRITCHARD & R. ALLMENDINGER, CORNELL UNIV. 


Quantum communications 
leap out of the lab 


China begins work on super-secure network as ‘real-world’ trial successfully sends 


quantum keys and data. 


BY JANE QIU 


ybersecurity is a step closer to the 
( dream of sending data securely over 

long distances using quantum physics 
— spurred by two developments. 

This week, China will start installing the 
world’s longest quantum-communications 
network, which includes a 2,000-kilometre 
link between Beijing and Shanghai. And a 
study jointly announced this week by the com- 
panies Toshiba, BT and ADVA, with the UK 
National Physical Laboratory in Teddington, 
reports “encouraging” results from a network 
field trial, suggesting that quantum communi- 
cations could be feasible on existing fibre-optic 
infrastructure. 


Conventional data-encryption systems rely 
on the exchange ofa secret ‘key’ — in binary 0s 
and 1s — to encrypt and decrypt information. 
But the security of such a communication chan- 
nel can be undermined if a hacker ‘eavesdrops 
on this key during transmission. Quantum 
communications use a technology called quan- 
tum key distribution (QKD), which harnesses 
the subatomic properties of photons to “remove 
this weakest link of the current system’, says 
Grégoire Ribordy, co-founder and chief execu- 
tive of ID Quantique, a quantum-cryptography 
company in Geneva, Switzerland. 

The method allows a user to send a pulse of 
photons that are placed in specific quantum 
states that characterize the cryptographic 
key. If anyone tries to intercept the key, the 
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act of eavesdropping intrinsically alters its 
quantum state — alerting users to a security 
breach. Both the US$100-million Chinese 
initiative and the system tested in the latest 
study use QKD. 

The Chinese network “will not only provide 
the highest level of protection for government 
and financial data, but provide a test bed for 
quantum theories and new technologies’, says 
Jian-Wei Pan, a quantum physicist at the Uni- 
versity of Science and Technology of China in 
Hefei, who is leading the Chinese project. 

Pan hopes to test such ideas using the net- 
work, along with a quantum satellite that his 
team plans to launch next year (see Nature 492, 
22-25; 2012). Together, he says, the technologies 
could perform further tests of fundamental > 
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Engineer Andrew Sharpe works on Toshiba’s quantum key distribution system in Cambridge, UK. 


> quantum theories over large scales (around 
2,000 kilometres), such as quantum non- 
locality, in which changing the quantum state 
of one particle can influence the state of another 
even if they are far apart, says Pan. 

Sending single photons over long dis- 
tances is one of the greatest problems in QKD 
because they tend to get absorbed by optical 
fibres, making the keys tricky to detect on the 
receiver's end. 

This is “a big challenge for conventional 
detectors”, says Hoi-Kwong Lo, a quantum 
physicist at the University of Toronto in Can- 
ada. But technological breakthroughs in recent 
years have significantly reduced the noise level 
of detectors while increasing their efficiency in 
detecting photons from just 15% to 50%. 

Vast improvements have also been made 
in the rate at which detectors can ‘count’ pho- 
ton pulses — crucial in determining the rate 
at which quantum keys can be sent, and thus 
the speed of the network. Counting rates have 
been raised 1,000-fold, to about 2 gigahertz, 
says Lo. 

The breakthroughs are pushing the distance 
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over which quantum signals can be sent. Trials 
using ‘dark fibres’ — optical fibres laid down 
by telecommunications companies but lying 
unused — have sent quantum signals up to 
100 kilometres, says Don Hayford, a researcher 
at Battelle, a technology-development com- 
pany headquartered in Columbus, Ohio. 

To go farther than that, quantum signals 
must be relayed at ‘node points’ — the quan- 
tum networks between Beijing and Shanghai, 
for instance, will require 32 nodes. To transmit 
photons over longer distances without the use 
of nodes would require a satellite. 

China is not alone in its quantum-com- 
munication efforts. A team led by Hayford, 
together with ID Quantique, has started 
installing a 650-kilometre link between Bat- 
telle’s headquarters and its offices in Wash- 
ington DC. The partnership is also planning 
a network linking major US cities, which 
could exceed 10,000 kilometres, says Hayford, 
although it has yet to secure funding for that. 

The Chinese and US networks will both use 
dark fibres to send quantum keys. But these 
fibres “are not always available and can be 
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prohibitively expensive’, says Andrew Shields, 
a quantum physicist at Toshiba Research 
Europe in Cambridge, UK. One way to side- 
step the problem is to piggyback the photon 
streams onto the ‘lit’ fibres that transmit con- 
ventional telecommunications data. However, 
those conventional data streams are usually 
about a million times stronger than quan- 
tum streams, so the quantum data tend to be 
drowned out. 

In the results announced this week, Shields 
and his colleagues were successful in achieving 
the stable and secure transmission of QKDs 
along a live lit fibre between two stations of 
the UK telecommunications company BT, 
26 kilometres apart. The quantum keys were 
sent over several weeks at a high rate alongside 
four channels of strong conventional data on 
the same fibre. 

The research builds on previous work in 
which Shields and his team developed a tech- 
nique to detect quantum signals sent alongside 
noisy data in a 90-kilometre fibre, but in con- 
trolled laboratory conditions (K. A. Patel et al. 
Phys. Rev. X 2, 041010; 2012). 

“Implementing QKD in the ‘real world’ is 
much more challenging than in the controlled 
environment of the lab, due to environmental 
fluctuations and greater loss in the fibre,’ says 
Shields. 

The quantum keys in the latest study were 
sent alongside conventional data travelling at 

40 gigabits per sec- 


The breakthroughs _ ond. “As faraslam 

are pushing the aware, this is the 

distance over highest bandwidth 

which quantum of data that has 

signals can be been multiplexed 

sent. with QKD to date,” 
add Shields. 


He calculates that it would be possible to 
send QKD signals alongside 40 conventional 
data channels. Optical fibres usually carry 
between 40 and 160 telecommunications 
channels, meaning that quantum communi- 
cation could be carried out with existing infra- 
structure. 

“T find it an impressive piece of work that 
demonstrates the multiplexing of strong clas- 
sical signals with quantum signals in the same 
fibre for the first time” in a field trial, says Lo. 
Removing the need for dark fibres, he says, is 
an important step in showing that QKD has the 
potential to be used in “real life”. m 


NATURE PODCAST 


The worth of the 

Y chromosome; an 
unusual amateur 
scientist; and how 
to help corals 


nature podcast 


TOSHIBA RESEARCH EUROPE 


breed nature.com/ 
nature/podcast 


go.nature.com/ 
[3mlxy 


@ Archaic human epigenomes get 
mapped go.nature.com/hnzsye 


ERIC AGOL (UW), NASA/SDO HMI 


442 | NATURE | VOL 508 | 24 APRIL 2014 
© 2014 Macmillan Publishers Limited. All rights reserved 


EQUINOX GRAPHICS/SPL 


BIOTECHNOLOGY 


IN FOCUS | NEWS 


RNA interference rebooted 


Gene-silencing technique yields promising treatments for liver -linked disorders. 


BY ERIKA CHECK HAYDEN 


fter a rocky start, RNA interference 
Aw a gene-silencing technique 

that won the 2006 Nobel Prize in Phys- 
iology or Medicine, is gaining momentum. 

In November, Alnylam Pharmaceuticals 
in Cambridge, Massachusetts, announced its 
first major success: evidence that an RNAi- 
based treatment drastically reduced levels 
of a toxic liver protein in people with a rare 
neurodegenerative disease. That opened up 
the floodgates: biotechnology firms have 
already raised nearly US$1 billion this year for 
RNAi-based drug development, and roughly 
150 therapies are being tested in patients. “It’s 
no longer a question of if RNA therapeutics 
will become a reality” but when, says John 
Maraganore, Alnylam’s chief executive. 

Dirk Haussecker, a biotechnology analyst 
in Baden-Baden, Germany, says that Alnylam 
and other companies pursuing RNAi thera- 
pies have overcome two significant challenges. 
First, they have solved the problem of delivering 
therapeutic snippets of RNA to the right cells at 
the right time. Second, they have narrowed their 
focus largely to diseases of the liver, exploiting 
its cells’ natural function: absorbing large mol- 
ecules from the blood. “For a long time, people 
doubted that this technology could yield a drug 
that is commercially viable, but that doubt has 
been removed over the past two years by clini- 
cal results,” says Haussecker, author of the blog 
RNAi Therapeutics (see go.nature.com/gtekcc). 

Still, sceptics argue that the technology is 
not ready for prime time: with only limited 
results from clinical trials of RNAi therapies, 
it is not yet certain that the approach is free of 
significant side effects. Naysayers also note that 
several major pharmaceutical companies have 
abandoned RNAi research since 2010. Most 
recently, Novartis in Basel, Switzerland, said 
on 14 April that it is substantially scaling back 
its RNAi research programme, citing “ongoing 
challenges with formulation and delivery” of 
RNAi therapies and the prospect that they may 
work for only a narrow range of conditions. 

RNAi was discovered in 1998 by two US 
geneticists: Andrew Fire, now at Stanford 
University, California, and Craig Mello of the 
University of Massachusetts Cancer Center in 
Worcester. The technique involves infusing a 
cell with small pieces of custom-made RNA. 
These bind to and disrupt the native RNA that 
translates DNA into proteins. Scientists have 
long hoped that RNAi could block mutant genes 
that encode faulty proteins that cause disease. 


UPS AND DOWNS 


Large drug companies have abandoned 
studies of RNA interference (RNAi), but 
smaller biotech firms are pushing ahead. 


ea aA US geneticists 


Andrew Fire and Craig Mello win Nobel 
prize for RNAi (RNA molecule pictured). 


Ue aA Vierck pays 


US$1.1 billion for California-based RNAi 
pioneer Sirna Therapeutics. 


NTH Att OPKO Health of Miami, 
Florida, terminates phase III trial of RNAi 
therapy for macular degeneration. 


PT awa Swiss firm Roche 
ends RNAi research after 3 years and 
$500 million. 


SOE aR Alnylam reports 


successful phase II trial of RNAi-based 
drug for liver-linked disease. 


SU A ra Alnylam buys Sirna 


from Merck for $175 million. 


APRIL 2014 
RNAi research. 


Novartis shutters most 


But getting delicate, small RNAs into cells 
has proved difficult. And the first major RNAi 
clinical trial — which aimed to treat macu- 
lar degeneration — was halted in 2009 after 
reports that the therapy might actually cause 
blindness. The small RNAs used in the trial, 
delivered with little modification, triggered cell 
death through an immune pathway that can be 
activated by any RNA more than 21 bases long. 

RNAi-based therapies fell out of favour after 
the failed trial, but smaller firms kept working 
on delivery systems. One such system, devel- 
oped by Tekmira in Burnaby, Canada, is based 
on lipid nanoparticles. Patisiran, the drug 
made by Alnylam to treat a rare disease known 
as transthyretin-mediated amyloidosis, is an 
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intravenous infusion of small RNAs packaged 
in these nanoparticles, which resemble a sus- 
pension of very small oil droplets. The drug is 
intended to reduce the production ofa mutant 
liver protein that circulates in the blood and 
accumulates in other tissues, causing nerve 
damage and, in many cases, death. 

In a 29-patient trial, Alnylam showed that 
patisiran could reduce blood levels of the pro- 
tein, transthyretin, by as much as 96%. The 
company has begun an expanded trial that 
it plans to conclude in 2017. It is also testing 
other RNAi-based drugs using its own delivery 
vehicle, a sugar molecule that is attached to a 
small RNA and targets a receptor on liver cells. 

Another firm, Arrowhead Research of 
Pasadena, California, has devised a third major 
delivery system — a customizable polymer 
that deposits RNAs in target cells. It is now 
being tested in clinical trials. “We've spent a 
lot of time in the wilderness trying to solve the 
delivery problem,” says Christopher Anzalone, 
Arrowhead’s chief executive. 

Victor Kotelianski, a former Alnylam 
executive who directs the RNA centre at the 
Skolkovo Institute of Science and Technol- 
ogy in Russia, says that caution is warranted. 
Larger trials of patisiran might uncover side 
effects that have not yet surfaced, if the RNAi 
approach is not as precise at hitting its targets 
as early results suggest. The company has also 
not yet proved that removing faulty transthyre- 
tin proteins from the blood actually resolves 
patients’ symptoms, he says. 

There are still enough lingering questions 
about RNAi’s usefulness to give large firms 
pause. “With a technology like RNAi, you dont 
necessarily know what its applications in the 
clinic will be, and those applications might not 
be compatible or consistent with the goals of the 
big pharmaceutical companies,’ Haussecker 
says. Before Novartis’s decision to abandon 
most of its RNAi drug development, Merck of 
New Jersey disclosed on 12 January that it had 
sold its small-RNA drug-discovery programme 
to Alnylam for $175 million — much less than 
the $1.1 billion that Merck paid to kick-start its 
RNAiresearch in 2006 (see ‘Ups and downs’). 

Yet Maraganore remains bullish, drawing a 
parallel between RNAi therapies and mono- 
clonal antibodies — large, complex proteins 
that bind to specific cell receptors, which were 
brought to market by focused biotech compa- 
nies, not big drug firms. “Large companies dont 
do a good job with technology,’ Maraganore 
says. “It’s the same story as with antibodies — 
were seeing that movie play out all over again.” m 


24 APRIL 2014 | VOL 508 | NATURE | 443 


Biologists are directing the 
evolution of corals to prepare 
them to fight climate change. 


BY AMANDA MASCARELLI 


ff the coast of American Samoa, the tropical sun beats 

down on a shallow tidal lagoon, heating the water to a 

sizzling 35°C for a few hours each day. Such tempera- 

tures would kill off most coral reefs, and yet the Samoan 

lagoon hosts courtyards of antler-like branching corals 

and mound corals the size of refrigerators. “The fact that 
they're there means they've adapted to survive,’ says Steve Palumbi, a 
marine biologist at Stanford University in California. “The real question 
is: how did they do that and can all corals do that?” 

Palumbi is just starting to understand how these Samoan corals thrive 
in such extreme conditions. And he thinks he might be able to harness that 
ability to create a reef of hardy coral with a chance of surviving the hot seas 
that are expected to result from climate change. Starting in August, he and 
his team are going to try to plant “the smartest future reef we can imagine”. 

Palumbi is part of a small group of coral 
researchers around the world tackling such _ Reefs thrive in the hot 
issues to throw threatened reefs a lifeline. Their waters of American 
ultimate intent is to launch a programme of Samoa that would kill 
‘human-assisted evolution, creating resistant _ other corals. 
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corals in controlled nurseries and planting them in areas that have been 
— or will be — hard-hit by changing conditions. “Tt’s a brave new world 
of working with corals in this way,’ says Ruth Gates, a marine biologist 
at the University of Hawaii at Manoa who, along with coral geneticist 
Madeleine van Oppen at the Australian Institute of Marine Science in 
Townsville, is helping to pioneer the field. 

The work is not without controversy. Although no one is yet 
attempting to create genetically modified corals, some researchers are 
concerned that human-assisted evolution goes too far down the slip- 
pery slope of altering natural systems. “If you're basically farming a 
reef, you've taken a natural habitat and you've converted it;’ says Steve 
Vollmer, a coral geneticist at Northeastern University’s Marine Science 
Center in Nahant, Massachusetts, who feels that more needs to be 
known before embarking on such programmes. “It’s like going to the 
Midwest and taking grasslands and making it into soy. There are huge 
implications to doing this.” 


IN HOT WATER 

Coral reefs have been besieged in recent decades by everything from 
warming waters to ocean acidification, disease, overfishing and pollu- 
tion. According to Status of Coral Reefs of the World: 2008, a synthesis 
report’ by hundreds of scientists and environmental managers, 19% of 
the world’s coral reefs have been lost since 1950 and another 35% are 
threatened or in critical condition. Some areas have suffered dispropor- 
tionately: the Caribbean, for example, has lost 80% of its reefs since the 
1970s (ref. 2). By the end of this century, researchers expect ocean waters 
to drop froma pH of 8.1 to 7.9 or lower, and to warm by at least 2°C, aver- 
aged across the globe. “It’s kind of like if you pull the plug on the bathtub 
and the water is rushing out — that’s the state of corals,” says Palumbi. 

Reef-restoration projects have been focusing on the Caribbean and 
other hard-hit spots for more than 20 years. In these programmes, small 
samples are taken from local reefs and grown in controlled coral nurs- 
eries. After a few months, fragments the size of a hand or larger can be 
‘outplanted’ to a reef using underwater cement, where the coral will 
continue to grow. Such projects have shown that transplantation and 
reef restoration can be done on a small scale. But transplanted corals 
grow more slowly and have higher mortality rates than normal’. “Coral 
restoration has always been highly expensive and slow and inefficient,” 
says Palumbi. “Figuring out how to do this in a smarter way is our goal.” 

That smarter way takes advantage of the surprising resilience and 
resourcefulness of some corals and the symbiotic algae that live inside 
them. “Sometimes we find reefs that are doing very, very well in places 
that you would least expect to find them,” says Gates — such as a reef off 
Taiwan that lies below the waste-water outfall pipe of a nuclear power 
plant and experiences temperature fluctuations of between 6°C and 8°C 
per day. “By all of our understanding, we would expect those corals to all 
be dead. But they’re not, they're flourishing.” 

Waters with a reduced pH are expected to dissolve coral skeletons — 
but in Palau in the western Pacific Ocean, researchers have found’ reefs 
that are bigger and more diverse in relatively acidic waters than the Pacific 
average. Another study” found that dire predictions about the frequency 
of future coral-bleaching events — mass die-offs when stressed corals 
lose their symbiotic algae — are reduced by 20-80% if the models take 
into account corals’ ability to adapt after previous bleaching events. That 
delays predicted mass reef deaths by about a decade. 

So far, researchers have only a handful of hints as to what makes some 
corals resilient. Ina study® published in 2013, Palumbi and his colleagues, 
including Daniel Barshis, a marine biologist at Stanford, compared two 
populations of the reef-building coral Acropora hyacinthus at their field 
site off Ofu Island in American Samoa. One population lives in the toasty 
pool where temperatures reach 35°C during sum- 


mer low tides and fluctuate by up to 6°C daily; the NATURE.COM 
other, less isolated by tides, has to deal with tem- To hear more about 
peratures of only about 29°C. The team placed _ howresearchers are 
samples in controlled tanks and shocked them __ helping corals, visit: 
with temperatures of nearly 3°C above normal for __go.nature.com/ov46wo 
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four days. All of the corals bleached by the end of the fourth day. But 
those from the hotter pool survived for longer and had higher expression 
of 60 genes, including well-known thermal-tolerance genes such as those 
that make heat-shock proteins and antioxidant enzymes. 

Palumbi and Barshis think that genetic fitness and acclimatization 
both play important roles in boosting tolerance. Their analyses suggest 
that corals can ‘toughen up over the course of their lifetimes in response 
to environmental conditions. Those in the hot pool are physiologically 
primed to tolerate additional heat stress, “like an athlete who's been 
training every day since a very early age’, says Barshis. 


“WE WOULD EXPECT THESE 
CORALS TO ALL BE DEAD. 
BUT THEY'RE NOT, THEY'RE 
FLOURISHING.’ 


A promising twist is that the more heat-tolerant species seem also 
to be more transplant-friendly. After experimentally planting some 
400 samples from the two reef areas back into the two pools, Palumbi 
and his team found that the corals from the hotter pools transplanted 
more efficiently and grew faster than those from the cooler pools. 

This August, Palumbi and his colleagues plan to begin an experimental 
restoration project on Sili Reef off Ofu Island. To select the best corals, the 
researchers will rely on their extensive data for the area, including growth 
measurements and transcriptomes — blueprints of the part of the genome 
that is actively transcribed into proteins. They also plan to use data froma 
portable stress test for corals that Palumbi is developing — “like a human 
treadmill test for cardiac function’, he says. He and his team have built 
tanks out of 7.5-litre cooler boxes rigged with lights, heaters and chillers 
that can dose corals with a controlled bout of high physiological stress. 
By monitoring bleaching and chlorophyll content, they should be able 
to predict how corals might respond to potential bleaching conditions. 

Using all this information as a guide, they will handpick the hardiest, 
fastest-growing and most heat-resistant corals for their smart reef. At the 
same time, they will build a second reef from corals selected at random. 
They will then monitor reef survival over several years. “The question 
is: can we do better if we have a lot of information about the individual 
corals?” says Palumbi. “Honestly, I don’t know the answer.” 


LEGACY OF SURVIVAL 

Others have found encouraging evidence that stress resistance gained 
through acclimatization can be passed on to offspring. Unpublished 
work by Gates, led by the University of Hawaii's Hollie Putnam, shows 
that adult cauliflower corals (Pocillopora damicornis ) exposed to stress 
during brooding produce larvae with increased resilience to heat and 
ocean acidification. The team hypothesizes that this transgenerational 
protection is caused by epigenetic changes: the modification of molecu- 
lar tags on the genome that affect gene expression. 

Gates and van Oppen are aiming to look specifically at areas that have 
already survived massive bleaching events, such as Moorea in French 
Polynesia, the central Great Barrier Reef in Australia, and the Seychelles, 
where 97% of corals in the inner islands died following the 1997-98 
El Nifio oceanic warming event. (A nursery has already been created 
from the Seychelles corals that survived, and fragments grown from 
them have been planted onto reefs to aid their recovery.) Gates and 
van Oppen aim to cross-breed corals that have survived such stressful 
bleaching, and to track the resilience of the offspring. 

Their ideas won Gates and van Oppen the 2013 Paul G. Allen Ocean 
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Madeleine van Oppen collects corals for study and selective breeding. 


Challenge prize of US$10,000, along with an invitation to apply for 
multimillion-dollar funding. Depending on how much of that funding 
comes through, they also aim to use heat and acidity to stress corals 
before they breed, to see if and how tolerance gets passed down the 
generations. Beginning in May, van Oppen and her team will start col- 
lecting adults of the branching coral Pocillopora acuta from the Great 
Barrier Reef, and will grow them in the Australian Institute of Marine 
Science's massive National Sea Simulator, an aquarium facility that pro- 
vides controlled tanks to replicate open-ocean conditions. 

Ultimately, Gates and van Oppen hope to create a ‘seed bank of 
gametes and fertilized embryos from extreme settings in which corals 
persist despite the odds — including the shallow reefs skirting Coco- 
nut Island, Hawaii, where both temperature and pH fluctuate drasti- 
cally, reaching upper limits similar to those expected in the open ocean 
by 2050. The seed bank would add to efforts spearheaded by the US 
Smithsonian Institution, in collaboration with Hawaiian and Australian 
bodies, which are already banking coral sperm and embryonic cells. 

A final, important piece of the puzzle is the corals’ symbiotic algae: these 
are shorter-lived and faster-evolving than their hosts, and research has 
shown that they can pass along thermal tolerance. One study’, for exam- 
ple, found that juvenile corals inoculated with strains of algae collected 
from a warm reef known for heat resistance grew well when exposed to 
temperatures up to 32°C, whereas samples of the same coral inoculated 
with algae from a cooler reef suffered bleaching and tissue death. 

Andrew Baker, a marine biologist at the University of Miami in Flor- 
ida, and his colleagues discovered* that symbionts from a lineage called 
clade D tend to become more prevalent in some corals when they are 
heat stressed, suggesting that the algae are better able than other strains 
to survive such conditions, and that they help their hosts to survive too. 
Since then, studies have shown’ that clade D symbionts, in particular 
types D1 and Dla, are prevalent in a wide variety of corals that have 
survived extreme bleaching events. Putnam, Gates and their colleagues 
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have found” that a different strain, C15, seems to be dominant in heat- 
resistant corals near Moorea. 

Researchers such as Baker are starting to think about the possibility 
of intentionally seeding coral reefs with hardier strains of algae to help 
them to resist the perils of climate change. But it is still unclear whether 
it will be possible to manipulate symbiont populations effectively in the 
wild, where environmental conditions might cause the corals to favour 
one type of alga over another. 

Workers at existing coral nurseries and farms have been sending 
samples of coral and symbionts to researchers for genetic sequencing, 
while keeping tabs on which organisms fare well in heat shocks or disease 
outbreaks. Researchers have banked hundreds of genotyped strains from 
a handful of coral species, including the critically endangered staghorn 
coral (Acropora cervicornis) from locations in the Caribbean, says Les 
Kaufman, a marine biologist at Boston University in Massachusetts. 


HELP OR HARM 

The days of trying to build reefs with designer-made corals are still in 
the future. But as the research heads in that direction, some are wary 
that such tinkering might do more harm than good. 

Selecting for traits such as resistance to heat or acidification might lead 
to a genetic bottleneck, for example. “Selective-breeding programmes 
may effectively reduce the capacity of corals to adapt to future changes 
in environmental conditions by narrowing genetic variation,’ says David 
Miller, a coral biologist at James Cook University in Townsville. And that 
is if selective breeding in corals even works. It is too soon even to tell 
whether acid and heat resistance are strongly heritable, he says. 

Miller and others point out that cross-breeding to enhance specific 
traits in crops and dogs, for example, often comes at the expense of 
other traits. “There's often a ‘trade-off’ effect, so that, for example, more- 
stress-tolerant individuals are likely to grow more slowly,’ says Miller. 
Selecting for resilience against heat and acidity could hypothetically lead 
to higher susceptibility to disease, for instance. 

Manuel Aranda, an evolutionary molecular biologist at the Red Sea 
Research Center at King Abdullah University of Science and Technol- 
ogy in Thuwal, Saudi Arabia, agrees that breeding might come at a cost. 
But he says that the serious decline in reefhealth warrants exploring all 
available options. “If you think about losing an entire ecosystem, you 
want to start somewhere.” 

Until recently, says Baker, the goal for coral-reef management was 
simply to create marine reserves and reduce the pressures of pollution 
and fishing, hoping that that would leave reefs strong enough to deal 
with climate change. “The pendulum has sort of shifted as people have 
realized just how dire the situation is,’ says Baker. “We need to do more 
than that. We need to take action” 

Some 500 million people depend in some way on coral reefs for 
food and income, and the livelihoods of another 30 million are entirely 
dependent on reefs’. For Gates, statistics like those, combined with the 
facts of climate change, make the pursuit of assisted evolution necessary 
and urgent. “We don't have a lot of time,” she says. = 


Amanda Mascarelli is a freelance reporter based in Denver, Colorado. 
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BIOFUELS 


Anew generation of industrial plants can make liquid fuels from almost 
any organic scraps — from corn stalks and wood chips to urban rubbish. 


BY KIM KRIEGER 


y the end of 2015, all British Airways flights out of London 
i City Airport will be fuelled by rubbish — the paper, food 
(B scraps, garden clippings and other organic detritus dis- 
carded by the city’s residents. 

But before it goes into planes, the rubbish will be pro- 
cessed at GreenSky London: a biofuels plant under construction on the 
eastern side of the city. Each year, the facility will take in some 500,000 
tonnes of the city’s waste and will transform the organic component into 
60,000 tonnes of jet fuel, a similar quantity of diesel fuel combined with 
petrol-like naphtha, and 40 megawatts of power. 

This level of output would hardly be noticed at conventional petroleum 
refineries, which typically generate a similar volume of product within a 
week. But “gathering enough biomass to run a petroleum-scale refinery 
is almost unthinkable’, says Nathanael Greene, director of renewable- 
energy policy at the Natural Resources Defense Council in New York 
City. GreenSky London is typical of a trend for second-generation biofuel 
reactors that are not only omnivorous — they can be fed with corn stalks, 
wood chips and other forms of agricultural waste, as well as urban rubbish 
— but also small. The hope is that they will slash transportation costs by 
bringing the reactors to the biomass instead of vice versa. 

Proponents argue that novel catalytic techniques and compact designs 
will make second-generation biofuel plants not just environmentally 
friendly, but also profitable enough to compete with petroleum-based 
fuels without subsidies. Questions remain about how realistic this hope 
is. But at least some customers are giving the plants a try; commercial 
units have begun to spring up from Finland to Mississippi to Alaska. 

If these second-generation plants do succeed, says Greene, they will 
offer at least one crucial advantage over their predecessors: a low-carbon 
way to create fuel that suits existing vehicles. 

Limited compatibility in this regard is a key problem dogging the first 
generation of biofuels plants, which rely on technology developed over 
millennia to make beer, wine and spirits. These facilities grind up edible 
products such as corn or sugarcane, add water and yeast, and allow fer- 
mentation to take its natural course. The result is a copious supply of ethyl 
alcohol, which makes an excellent fuel and can be mixed with petrol. 

But there are serious drawbacks to making fuel from food in a world 
with a growing population and limited arable land. So, for more than a 
decade, the biofuels industry has been working on economical ways to 
use cornstalks, wood chips and other by-products that currently go to 
waste. This has posed a challenge for the fermentation approach, because 
these materials contain tough, long-chain molecules such as cellulose 
and lignin that yeast cannot easily digest. Over the past five or ten years, 
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advances involving pretreatment with acids and enzymes have partially 
overcome that barrier, and commercial plants designed to produce cellu- 
losic ethanol are now under construction in Iowa and Kansas (see Nature 
507, 152-153; 2014). 


HITTING THE WALL 

But even these facilities will not be able to overcome the biggest restric- 
tion on the fermentation approach: the ‘blend wall. This is the maximum 
amount of ethanol that can be mixed into petrol without causing corro- 
sion in fuel lines and car engines. For current models, the blend wall is 
about 10-15% — and first-generation fermentation plants already pro- 
duce more than enough ethanol to meet this demand. Indeed, several 
US ethanol refineries built in the past decade already stand idle, victims 
of drought-induced price hikes and market saturation. 

That reality, paired with nine years of historically high oil prices — 
the price per barrel currently stands at around US$100 — has spurred 
vigorous research into thermochemical reactors, which convert biomass 
directly into fuels other than ethanol using heat and catalysts. 

The most common thermochemical approach is gasification, in 
which carbon-rich material such as coal, wood chips or municipal waste 
is heated to produce synthesis gas or ‘syngas’ — a mixture of mainly 
hydrogen and carbon monoxide, with traces of carbon dioxide and 
other gases (see ‘Fuel flow’). At the GreenSky London facility, one or 
more proprietary gasifier units built by Solena Fuels, a renewable energy 
firm in Washington DC, will accomplish this step by vaporizing the 
waste with jets of ionized plasma that heat the material to some 3,500 °C. 
Such torches are more energy intensive than other methods of gasifica- 
tion, in which the biomass is heated from below on beds of sand or other 
material. GreenSky London elected to use them because the contents of 
municipal waste can vary markedly, and by adjusting the temperature 
of the torches, the composition of the syngas can be kept consistent. 

Consistency is important for optimizing the second step of the process, 
in which the syngas is sent into a chemical reactor — made, in the case 
of GreenSky, by Velocys of Plain City, Ohio. There, it undergoes the Fis- 
cher-Tropsch reaction, which fuses hydrogen and carbon monoxide into 
long-chain hydrocarbons. Velocys has made its system unusually compact 

by reducing the size of the cobalt-containing cata- 


> NATURE.COM lyst particles to the nanometre scale and arranging 
For more on small, them along a series of microchannels, which direct 
local sources of the flow of the syngas and produce reaction sur- 
power, see: faces with an effective area of a much larger device. 


go.nature.com/nubllr The Fischer-Tropsch units have also been made 
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FUEL FLOW 


The fossil-fuels industry has long used 
gasification and pyrolysis to produce 
liquid fuels. The same techniques are 
now turning biomass into fuels that 
can be refined into diesel and petrol. 


GASIFIER 
Heats biomass 
to 900°C or more 


— 


syngas 


FEATURE 


FISCHER-TROPSCH REACTOR 


Turns hydrogen and carbon monoxide 


into liquid hydrocarbons 


ae 


REFINERY 


PYROLYTIC REACTOR 
Heats biomass to about 500°C 
in absence of oxygen 


as modular as possible, so that chunks can be assembled at a factory and 
then be plugged together on site. “You need to achieve economy not 
through size, but in the way you build them,” says Neville Hargreaves, 
business-development manager at Velocys. 

Another compact system is the BioMax gasifier developed by the 
Community Power Corporation in Englewood, Colorado. The com- 
pany says that this device is modular and small enough that four can 
fit into a standard shipping container, and can run on almost any kind 
of shredded biomass, from food scraps to cardboard to wood chips. 
The resulting syngas can then be used in place of natural gas for heat- 
ing, cooling or electricity generation. A typical unit generates about 
150 kilowatts, enough to power between 25 and 50 homes, run three 
supermarkets or keep vital hospital equipment working. And in the 
near future, BioMax units should be able to plug in a Fischer-Tropsch 
reactor and produce biodiesel as well. 

In 2011, Community Power was bought by the Afognak Native Cor- 
poration, which is owned by the indigenous people of Alaska’s Afognak 
Island. They hope to sell the units throughout Alaska and northern 
Canada, where electricity and transportation fuel are expensive. 


CLEAN COMBUSTION 
Among the strongest selling points of the two-step gasification approach 
to biofuels is the fact that almost all the syngas gets turned into hydro- 
carbons with no double bonds or ring structures, producing fuels that 
burn cleanly and completely. But that advantage has not kept researchers 
from exploring a single-step alternative. In the pyrolysis approach, the 
biomass is heated in the absence of oxygen to some 500°C and converted 
into organic liquids directly. These liquids can then be refined into fuels 
using standard technology. Pyrolysis is relatively immature compared 
with gasification, says Mark Nimlos, a principal scientist at the National 
Renewable Energy Laboratory in Boulder, Colorado. But that can be 
seen as a virtue, he adds. “There is lots of potential to improve” 
Several companies are already testing the commercial viability of the 
technology. For example, UOP of Des Plaines, Illinois — a subsidiary 
of the New Jersey-based conglomerate Honeywell International — is 
partnering with Ensyn Technologies of Ottawa to market Ensyn’s Rapid 
Thermal Processing (RTP) units. The companies foresee these units 
being installed next to lumber mills, where each one would be capable 
of turning waste wood into some 76 million litres of pyrolysis oil a year. 
That would be enough to warm 31,000 homes if it were burned directly 
as heating oil; alternatively, if refined into petrol, it could fuel about 
35,000 typical US automobiles. 


Green Fuel Nordic, a biorefining company based in Kuopio, Finland, 
is planning to install at least one RTP unit in the Finnish town of Iisalmi, 
where it will process waste from the country’s extensive forestry indus- 
try. The company is also working with the European Commission to 
develop a set of quality standards for pyrolysis fuels. One component 
of concern is tar: a gummy residue of long-chain molecules that are 
hard to refine. Another is oxygen, which is abundant in biomass and 
reacts with pyrolysis oil to form organic acids that can seriously cor- 
rode refinery equipment. Finding better ways to deal with both of these 
contaminants is a major goal of pyrolysis-oil research. At present, the 
easiest way to remove the oxygen is to add molecular hydrogen derived 
from natural gas, but that would both undermine the climate-friendly 
appeal of pyrolysis oil and drive up the cost. 

Indeed, the financial viability of any of the second-generation biofuel 
technologies is still an open question. Witness the saga of one of the 
world’s most advanced pyrolysis biorefineries: a $225-million facility in 
Columbus, Mississippi. Owned by KiOR, a renewables company based in 
Pasadena, Texas, the facility demonstrated its technical viability by pro- 
ducing some 3.5 million litres of petrol and diesel fuel from wood waste 
in 2013 — about as much as a conventional petroleum refinery produces 
ina day. But KiOR, which shut down the Columbus refinery for upgrades 
in January, will run out of operating funds by the end of August — and it 
has enough money to last that long only because it secured a $25-million 
loan earlier this month from billionaire venture-capitalist Vinod Khosla, 
whose company, Khosla Ventures, originally funded the project. 

For GreenSky London, economic viability also remains an open ques- 
tion. But its partners — Velocys, Solena and British Airways — are 
hopeful. They have not disclosed the cost of the facility, but none of 
them see cost as the central issue. British Airways expects the operation 
to help it meet the carbon-emissions targets mandated by the European 
Union while ensuring a steady supply of jet fuel that is not subject to 
the price fluctuations that plague the oil market. And both Solena and 
Velocys hope that GreenSky London will be the first of many such facili- 
ties serving airports around the world. 

Every field, forest and landfill site is a potential fuel source for these 
facilities, says Hargreaves. And the need for liquid fuels will never com- 
pletely go away. “Fifty years in the future, we might get land transporta- 
tion entirely electrified,” he says. But aircraft require a level of energy 
density that batteries simply cannot provide. Liquid fuel, he says, 
“is very difficult to substitute.” m 


Kim Krieger is a freelance writer in Norwalk, Connecticut. 
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Gather and use genetic 
data in health care 


Research into how genetic variants can guide successful treatments must 
become part of routine medical practice and records, says Geoffrey Ginsburg. 


ore and more people are getting 
Me DNA sequenced. But the 
use of genetic data to inform 
medical decisions is lagging. More than a 
decade since the Human Genome Project 
was declared complete, fewer than 60 genetic 
variants are deemed worthy for use in clini- 
cal care, most for severe conditions in very 
young children’. 
These genetic variants can guide medical 
decisions (see ‘Genes that doctors use’). By 
some estimates, women with certain variants 


in the BRCA genes have about an 80% chance 
of developing breast cancer, leading some 
who carry the mutation to opt for preven- 
tive mastectomies. Screening for faulty genes 
involved in iron transport can alert affected 
individuals to a need to alter their diets to 
avoid developing haemochromatosis, a toxic 
build-up of iron that damages the liver, heart 
and other organs. Mutations in the EGFR 
gene can indicate whether lung cancer will 
respond to expensive drugs with fewer side 
effects than standard chemotherapy. But five 
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years after EGFR tests were commercialized, 
only around 6% of appropriate US patients 
were being genotyped, partly because their 
physicians were unaware of the tests’. 
Clinical trials have been used to assess 
whether genomic information yields practi- 
cal benefits. A study” of nearly 2,000 patients 
with HIV showed that genetic screening for 
a variant called HLA-B*5701 could help to 
prevent toxic reactions to the AIDS drug 
abacavir — a fact that is now written into US 
treatment guidelines. And last year, two > 
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> separate randomized trials** evaluated 
whether genotyping should guide the use of 
warfarin, prescribed to prevent blood clots. 
The studies reached different conclusions, 
perhaps demonstrating that such trials are 
not always the best way to assess the use of 
a genetic test. 

What is more, such trials can take years 
and millions of dollars to complete. They are 
usually paid for by pharmaceutical compa- 
nies in the hope of selling a specific product. 
But only a fraction of genetic variants are 
tied to commercial opportunities. 

Meanwhile, numbers of intriguing genetic 
variants are surging. Analysis of tumour 
genomes has revealed some 140 genes whose 
mutations contribute to cancer®. Saudi Ara- 
bia, the United Kingdom and the United 
States have all launched projects that in 
total will sequence about 100,000 individu- 
als. The clinical-sequencing market has been 
estimated at more than US$2 billion. 

For these investments to pay off, there 
needs to be an efficient way to evaluate 
whether genetic infor- 


mation leads to better “The clinical- 
health care.Onesolu- sequencing 
tion is to gather evi- market 

dence during routine hasbeen 
clinical care.In other eeftiynated at 
words, health sys- yore than 


tems shouldembrace [j§ ¢ hillion.” 
research. This will 
require hefty investment in new workflows, 
informatics and data-analysis techniques. 
With the right infrastructure, the number 
of clinically useful variants could grow into 
the thousands, making genomics crucial to 
medical care’. In the United States — my 
focus in this article — a large, fragmented 
health sector fosters many independent arms 
that can struggle to work together, in contrast 
to a national health-care delivery system such 
as the National Health Service in the United 
Kingdom. But the need for health-care sys- 
tems that learn is similar the world over. 


COMBINE CARE AND RESEARCH 

Some health-care organizations are working 
out ways to collect evidence from ongo- 
ing care to improve patient outcomes. Such 
efforts systematically analyse clinical data 
from peer-reviewed literature and introduce 
selected findings into clinical care. Then they 
track and evaluate outcomes. 

One of the best examples of such a ‘learn- 
ing health-care system’ is the Geisinger 
Health System, which serves 2.6 million 
patients in Pennsylvania. Once diagnosed 
with chronic conditions such as diabetes, 
heart failure or depression, patients’ care 
and outcomes are monitored. Metrics such 
as hospitalization rates are used to recom- 
mend that certain individuals receive coor- 
dinated attention from a team of specialists 
including mental-health practitioners and 
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GENES THAT DOCTORS USE 

Genetic variants can determine whether a treatment harms or heals. 

Condition Gene or variant 

Cystic fibrosis CFTR-G551D (plus eight other 
variants) 

HIV HLA-B*5701 

Melanoma BRAF V600E/K and similar 
variants 

Lung cancer EGFR 

Family history of BRCA1/BRCA2 

breast cancer 


nutritionists. Over three years, this reduced 
hospital admissions by 18% and seemed to 
reduce spending by 7% (ref. 8). 

A learning health-care system depends 
on electronic medical records (EMRs) — 
comprehensive, accessible stores of patient 
data. EMRs can, for instance, allow a health 
system to identify people with diabetes who 
have also been prescribed blood-pressure 
medication, and determine rates of heart or 
liver failure according to age or gender. 

Dozens of health-care systems are adapting 
to conduct genomics research’. Several hos- 
pitals, including Children’s Mercy Hospital 
in Kansas City, Missouri; Baylor College of 
Medicine in Houston, Texas; and the Medi- 
cal College of Wisconsin in Milwaukee, are 
using sequencing to unravel maladies that 
defy diagnosis, including life-threatening 
intestinal inflammation in a young child and 
the probable cause of seizures in another. 

The UK Genomics Initiative is sequenc- 
ing 100,000 people’s genomes and link- 
ing the resulting data to the participants’ 
National Health Service records. In the 
United States, the care consortium Kaiser 
Permanente and the University of Califor- 
nia, San Francisco, are testing whether a 
set of genetic variants assessed in 100,000 
patients might identify those with higher 
health risks. Vanderbilt University in 
Nashville, Tennessee, found” that 91% of 
all patients and 96% of African American 
patients carried one or more variants impli- 
cated in affecting response for five common 
prescription drugs, including warfarin and 
simvastatin, a cholesterol-lowering agent. 

Such ‘pharmacogenomic’ applications 
— in which genetic markers are used to fit 
drugs to patients — are among the most 
promising areas for collecting evidence 
during clinical care. In Thailand, about 
12% of people have genetic predisposition 
to Stevens—Johnson syndrome, in which 
certain medications trigger a blistering, 
life-threatening rash. The government has 
sponsored a programme in which any Thai 
citizen can be genotyped to predict reactions 
to problematic drugs such as carbamaz- 
epine, commonly used to control seizures. 
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Action 
Select medication only effective in patients carrying 
the variants. 


Avoid HIV drug abacavir. Select alternate 
medication that is less toxic to patient. 


Select medication that is only effective in patients 
carrying the variants. 


Select medication that is only effective in patients 
carrying key mutations. 


Opt for close screening or pre-emptive mastectomy. 


Ramathibodi Hospital in Bangkok provides 
ahealth card for patients with risky genetic 
variants to present to pharmacists, alerting 
them to provide alternative medications. 
Whether decreased toxicity merits the use 
of less-effective drugs is being evaluated. 


INVESTIGATE AND INFORM 

Other efforts are making genomic informa- 
tion part of routine care during a visit to the 
doctor or a hospital stay. The US National 
Human Genome Research Institute is now 
funding projects that explicitly explore clini- 
cal implementation of genomic information 
through EMRs. The Electronic Medical 
Records and Genomics (eMERGE) net- 
work, a consortium of nine institutions that 
use commercial and homegrown EMRs, is 
developing ways to capture genomic infor- 
mation, store it securely and incorporate it 
into computer algorithms to guide clinicians. 

As more genetic data are incorporated 
into EMRs, health systems can carry out 
‘pragmatic clinical trials’: — those that rand- 
omize groups of clinicians to study an inter- 
vention (in this case treatments guided by 
genetic information) compared with usual 
care. Although standards for how results 
should be assessed and adopted still need to 
be worked out, such studies are cheaper than 
conventional clinical trials and can collect 
data from people with multiple conditions, 
who are often excluded from trials. 

To translate genetic analysis into improved 
medical care, hospital executives and other 
leaders must redesign how they collect 
and manage data so that it can be used in 
research. Existing patient databases for clini- 
cal care are often inadequate for evaluating 
health interventions: data are frequently 
missing or incorrect, and it can be hard to 
link data about, for example, demographics, 
appointments, procedures, medications and 
Vital signs. 

Pioneers of learning health-care systems 
have established streamlined consenting pro- 
cesses and data warehouses that link EMRs 
with genomic assays from patients’ banked 
biospecimens. However, systems are not con- 
sistently incentivized to build these capacities. 


ADAPTED FROM I. J. KULLO ET AL. GENET. MED. 15, 270-271 (2013) 


Health leaders want evidence before investing 
resources, yet without those investments, 
such evidence is hard to gather. 

This makes the experiences from 
diverse health systems all the more 
precious. In January this year, a 
nascent organization of countries 


and international organizations piges: 
met to discuss their efforts to Eas 
evaluate how and whether he 
genetic variants can improve Se 
clinical care. Programmes Be 

are a blend of data collection Eauis 
(bringing genomic information e 


and medical records together), 
clinical implementation (get- 
ting providers to use informa- 
tion) and outcomes (assessing the 
impact of using information). 

For example, the Singapore 
National Eye Center is using genetic 
testing to diagnose some eye diseases, 
select treatments and, when warranted, 
encourage family members to be screened. 
The Luxembourg Centre for Systems Bio- 
medicine and the Centre Hospitalier de Lux- 
embourg are diagnosing and stratifying risk 
for Parkinson's disease, and the University of 
Colombo in Sri Lanka is doing the same for 
inherited blood disorders. Vanderbilt Univer- 
sity, the Geisinger Health System and North- 
western University in Evanston, Illinois, are 
incorporating pharmacogenetic testing to 
select medications. And Duke University in 
Durham, North Carolina, allows people to 
report their family histories of disease online. 
This flags the individuals most likely to ben- 
efit from genetic screens for variants that raise 
colon-cancer risk or lead to heart defects that 
can cause sudden death. 

Such projects must be sure to provide 
iterative feedback on how genetic informa- 
tion is used and how it affects outcomes. It 
is important to understand how and when 


physicians and their patients should receive 
genomic information, and how itis likely to 
alter their decisions. This will inform when 
patients should get genetic tests and how 
complicated information can be delivered 
to both patients and physicians. 

But how will the use of genetic variants 
in medical decisions move from exploratory 
to mainstream? The US Centers for Disease 
Prevention and Control has developed a 
useful framework to evaluate new genom- 
ics tests. It includes whether a test measures 
genotype accurately, how reliably it provides 
information about a diagnosis or outcome, 
how likely it is to improve outcomes and 
whether the test has ethical, legal or social 
implications, such as uncovering false pater- 
nity or future untreatable conditions. 


TO-DO LIST 


Next steps for key players in medical genomics 


© Health-care providers. Learn to use 
electronic medical record (EMR) enabled 
tools and to apply genomic information to 
clinical decisions. 

@ Patients. Define preferences about what 
information they wish to learn and have 
placed in their EMR. 

© Clinical molecular geneticists. Interpret 
how genetic results change biological 
function, including variants that warrant 
clinical action. 

© Bioinformaticians and computing 
specialists. Develop curated genomic 
databases and means to query them to 
guide clinical decisions. 


© Genetic counsellors. Provide information 
about the potential burden of results and 
implications for family members. 

@ Policy-makers. Define the rules to return 
results to patients and ensure privacy and 
confidentiality. 

@ Researchers. Develop consensus on how 
best to conduct research using EMRs linked 
to genomic information. 

© Translational specialists. Train people 
across disciplines to communicate 
effectively and provide the ‘glue’ for the 
system to work. 

© Software vendors. Design effective EMR 
and related systems. 
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The medical-genomics community 
now needs to focus on how to decide 
what levels of evidence are required to 

demonstrate that assessing a variant 
helps patients. For example, a vari- 
ant that may require a change in 
diet might need less evidence 
than one that may require an 
___ irreversible surgical proce- 
_ dure. The clinical community 
should commit to developing 
__-ways to evaluate genetic vari- 
| ants as ‘actionable, including 
-_- understanding how results 
from one patient population 
might apply to another with 
differing genetic backgrounds 

or health practices. 

Most health systems around 
the world are not fit to shepherd 

genome sciences into routine health 
care. Each health system still makes 
ad hoc decisions about how to use genetic 
information. Holding international meet- 
ings and establishing consortia are essen- 
tial first steps. Hospitals must appreciate 
that informatics investments will benefit 
patients and efficiency. Such shifts will be 
difficult — clinicians may resist, resources 
will be tight and experiments will fail 
(see ‘Next steps for key players in medi- 
cal genomics’). Adoption will increase as 
success stories accumulate and prac- 
tices change, but those who act early will 
enjoy some advantages. US systems that 
made initial investments in institutional 
biobanks were better placed to compete 
for federal funding, for example. 
Meanwhile research and health-care 
funders should incentivize those embed- 
ding genetic information into clinical 
care. Providers, regulators, insurers and 
health ministries will have more confi- 
dence in gene-tailored treatments, and 
patients will hopefully have better care and 
improved health. = 


Geoffrey Ginsburg is director of genomic 
medicine at Duke University in Durham, 
North Carolina, USA. 

e-mail: geoffrey.ginsburg@duke.edu 
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Beatrix Potter’s 1895 drawings of the Boletus granulatus mushroom were part of a study that she submitted to the UK Royal Botanic Gardens at Kew. 


NATURAL HISTORY 


A scientist’s eye 


Beatrix Potter’s meticulous artistry served mycology and entomology as well as 
children’s fiction, reveals Linda Lear. 


discovery of a rare parasitic fungus on 

the Mar Lodge Estate in Aberdeenshire. 
Liz Holden, an independent field mycolo- 
gist, spotted the small jelly fungus Tremella 
simplex growing on the pink blobs of another 
rarity, Aleurodiscus amorphus. When she 
checked, she discovered that T. simplex had 
first been drawn in the late 1890s, by Beatrix 
Potter (1866-1943). 

Before Potter became a famous children’s 
author and illustrator, she was a pioneering 
naturalist and amateur mycologist, although 
later discouraged by professionals in Britain’s 
natural-history establishment. It was her habit 
to draw everything she saw under the lens, 
so Potter included the Tremella in her study, 
although she could not have recognized it 


E January, the British press reported the 
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then as an independent parasitic fungus. 
Potter was an extraordinary observer whose 
many contributions to natural science are 
only now becoming more widely recognized. 
Along with women such as Margaret Gatty, 
author of The History of British Seaweeds 
(1863), Potter was part of a generation of 
female naturalists whose work contributed 
to the advancement of professional science, 
whether acknowledged or not. 

Potter always prized the tribute paid to 
her by family friend John Everett Millais, the 
Pre-Raphaelite soci- 


ety painter: “plenty of NATURE.COM 
people can draw, but — ForLindaLearon 
you ... have observa- _ Potter's scientific 
tion” All her life, she — legacy: 
exhibiteda meticulous —_ go.tiafure.com/zzibml 
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concern for factual evidence. Her recording 
of observable data, although deliberately 
never systematic because she followed her 
artistic inclinations, marked her as a student 
of natural history from a young age. At nine, 
she was executing watercolour sketches of 
caterpillars, complete with physical descrip- 
tions and field observations. That she was 
interested in geology, archaeology, entomol- 
ogy and especially mycology was not unusual 
for someone raised in wealth and privately 
educated. What was rare was how Potter 
used her gifts in diverse areas, from stories 
for children and animal husbandry to the 
preservation of land, farms and watersheds 
in the English Lake District. 

Potter’s childhood offered unique 
opportunities for observing and recording 


© THE ARMITT TRUST 


© FREDERICK WARNE & CO., 1966 


BEATRIX POTTER SOC. 


nature. She enjoyed summers exploring 
and drawing the flora and fauna of Perth- 
shire near the River Tay in central Scotland, 
tagging along with her artistic parents and 
absorbing photographic techniques of per- 
spective and detail from her father, a fine 
amateur photographer. As a young woman, 
she explored the Tay Valley in her pony and 
trap, noting in her journal geological forma- 
tions, diversity of land use and the progress 
of soil erosion, and despairing over practices 
such as the dehorning of Ayrshire cattle. 


SCHOOLROOM SCIENCE 
The boredom of the Victorian schoolroom 
enhanced Potter’s skills almost by default. 
After lessons, Beatrix and her younger 
brother Walter drew a menagerie of animals 
secretly conveyed into the nursery — rab- 
bits, mice, hedgehogs, bats, snails and lizards 
— as well as more typical collections of 
insects and bird eggs. When a schoolroom 
pet died, the Potter children often boiled the 
corpse and articulated the bones to improve 
the anatomical accuracy of their drawings. 
Potter noticed that lettuce contained a 
“soporific” that made her pet rabbit Benja- 
min sleepy; that field mice were inordinately 
fastidious housekeepers; and that hedgehogs 
yawned “pathetically” and might bite when 
propped up in one position to draw. Such 
discoveries later informed the plots and 
characters of her children’s tales. 

Like the artist and critic John Ruskin, Pot- 
ter understood that the only way to know 
something was to draw it. First the hand- 
lens, then the camera, and finally the micro- 
scope taught Potter how to ‘see’ By her early 
30s, Potter’s enthusiasm was focused on how 
fungal spores reproduced — an issue that 
few British mycologists agreed on. During 
a holiday in Scotland in 1892, Potter had 
formed a botanical alliance with noted 
naturalist Charles McIntosh, who 
provided instruction in the micro- 
scope drawing of fungi in exchange 
for Potter’s accurate watercolours 
of rare specimens. By 1895 Potter ‘ 
had gathered young forms of the 
mushroom Boletus granula- 
tus, now known as Suillus 
granulatus, and drawn 
the spores and spore- 
producing structures, 
or basidia. Potter 
successfully ger- 
minated sporesof ya 
several species of g=—™ 
fungi,andmade ¥ 
drawings of the 
mycelium at differ- 
ent stages. 

She approached = 
the Royal Botanic 


< . 
Gardens at — 
— 


Kew with these 
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A Beatrix Potter study of insects, including a stag beetle (top left) and a bloody-nosed beetle (middle right). 


findings, only to be rebuffed by its director, 
William Thiselton-Dyer. She noted in her 
journal that “he hadn't the time to look at 
my drawings’, even though he “indicated 
the subject was profound”. Her uncle — the 
chemist Henry Enfield Roscoe — encour- 
aged her to continue her research, and in 
1897 she offered to the Linnean Society in 
London (which did not then admit women, 
or allow them to attend meetings) a paper: 
‘On the Germination of the Spores of Agari- 
cineae’, which was accompanied by several 
of her microscope drawings. Although this 
paper has been lost, it seems from her 
drawings and journal that Potter had 
become intrigued with the possibility 
of hybridization. 

Around the mid-1890s, Caroline 
Martineau, the principal of London’s Mor- 
ley Memorial College for Working 

Men and Women, commissioned 
Potter to produce a dozen 
lithographs to accom- 
pany lectures on ento- 
mology. Two survive; 
one shows a sheetweb 
spider, Linyphia trian- 
gularis. They are accurate, 
despite Potter’s recorded 
frustration with the 
errors in the Natural 
History Museum's 


Beatrix Potter asa 
teenager, with her 


¥ spaniel, Spot. 
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entomology index, and the misidentified 
specimens in the museums insect cases. 

Long after Potter had become the cel- 
ebrated author of The Tale of Peter Rabbit, 
The Tale of Two Bad Mice and more than 
20 other classic books for the young, she gave 
her prized mycological and botanical draw- 
ings to the Armitt Museum and Library in 
Ambleside in the Lake District. Today, they 
are still consulted by professional and ama- 
teur mycologists, and 59 of the drawings are 
reproduced in W. P. K. Findlay’s Wayside and 
Woodland Fungi (Warne, 1967). 

When her eyesight diminished, Potter 
turned to breeding prizewinning native 
Herdwick sheep, and to promoting the pres- 
ervation of the unique ecology and farming 
character of the Lake District. On her death 
in 1943, Potter, then Mrs William Heelis, 
bequeathed to the National Trust more than 
1,700 hectares of land, now enjoyed by thou- 
sands of visitors each year. 

In 1896, Potter summed up her delight 
in the natural world with a proclamation of 
supreme Victorian self-confidence: “With 
opportunity the world is very interesting.” 
The natural-history legacy of this shy but 
hugely curious and determined amateur 
continues to enlighten, and even dazzle. = 


Linda Lear is an environmental historian 
and biographer based in Washington DC. 
She is the author of Beatrix Potter: A Life in 
Nature (2007) and Rachel Carson: Witness 
for Nature (2009). 

e-mail: linda@lindalear.com 
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In the United States of the 1950s, blood donation was at the centre of a debate on the ethics of trading in body tissues. 


The commodified body 


Scott Carney assesses a study of banked human blood, sperm and milk. 


r | The US medical community’s view of 
the human body shifted in the twenti- 
eth century, when it began to be seen 

as a source of valuable tissues and fluids. 

In her thought-provoking Banking on the 

Body, Kara Swanson recounts how the use 

of human blood, semen and breast milk in 

medicine was made possible only by uniting 
interlocking and occasionally contradictory 
systems of body-part or product procure- 
ment and storage under the broad concept 
of ‘banking’ The financial metaphor set the 
stage for a showdown between competing 
ideas on how the body is exchanged as either 

a gift ora commodity. Although she focuses 

on fluids, Swanson argues that the same les- 

sons can be applied to all body parts, includ- 
ing kidneys or faces for transplantation, and 
human eggs. 

Swanson reveals how the body’s gradual 
commodification played out. She recounts 
how mothers in Boston, Massachusetts, were 
paid 60 cents a quart for their milk in 1929, 
but how demand for the fluid declined with 
the rise of formula milk. Blood was often 
more complex. At different hospitals, doc- 
tors either bought blood or made recipients 
promise to donate later. At times, the system 
of tissue exchange reflected contemporary 
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national dialogue. So 
in the Second World 
War, blood was a 
community resource 
donated freely to keep 
soldiers alive. During 
the cold war, however, 
creeping fear of com- 
munism made “free 4 
blood” unpatriotic — Banking onthe 

a foretaste of the way Body: The Market 
socialized medicine _ i" Blood, Milk, and 
: 3 Sperm in Modern 
is seen by many in the 


America 
United States today.In nna w swanson 
the 1960s, when for- Harvard University 
profit blood banks in Press: 2014. 


Kansas City, Missouri, 

sued Red Cross-style community blood 
banks, the arguments hinged on defin- 
ing whether blood was an interchangeable 
commodity or a “gift of life. The ‘currency’ 
of sperm banking has proved less fungible: 
sperm’s value fluctuates with the donor’s 
genetic profile, with geniuses commanding 
higher prices. 

In her close historical analysis, Swan- 
son, an intellectual-property lawyer, vaults 
over the profit-or-gift debate by show- 
ing that pragmatism reigns. In times of 
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shortage, physicians who support free 
blood exchange have routinely resorted to 
paying donors to bolster supplies. Blood 
becomes a commodity when selling it 
helps to build institutional profitability, as 
when the American Association of Blood 
Banks (AABB) lobbied to purchase blood 
in the 1950s. In the early 1960s, the AABB 
reversed its position when faced with the 
possibility of liability lawsuits after a patient 
received hepatitis-infected blood. Hospitals 
in New York argued that they were not sell- 
ing blood — only the medical services to 
transfer it. Distinctions between products 
and medical services became hopelessly 
tangled. 

So in today’s medical lingo, a person in 
need of blood in the United States cannot 
purchase a pint: rather, they rent the exper- 
tise of hospitals and surgeons to move the 
blood from one body to another. The blood 
is effectively free; the transfusion might end 
up costing thousands of dollars. The result 
is that everyone seems to be making money 
on body parts except for the donors. ‘Altru- 
istic donations are often deemed gifts only 
while they are being procured. Once alien- 
ated from the source, the commodified 
body parts can be sold as services between 
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different body banks and, ultimately, to 
consumers at high prices. 

Swanson’s solution is simple: scratch 
the doublespeak and start again. “Body 
products are property,’ she writes in her 
conclusion. “Markets in body products 
can be harnessed to serve communal 
goals. The professional donor can be a 
safe and respected supplier of body prod- 
ucts.” She makes clear that her aim is to 
redefine how all tissues — from blood to 
kidneys — are traded. What she does not 
quite articulate is that not all body parts 
are truly equal. 

Blood, milk and sperm — and of more 
recent interest, faeces — are the body’s 
renewables. The removal of a body part, 
however, entails considerably higher 
sacrifices. The danger, of course, is that 
as body products are commodified, 
so are people. Almost all the voices in 
her account are doctors, middlemen or 
regulators; donors and recipients are not 
represented. We do not hear from pris- 
oners who have been forced to sell their 
blood, or the many in Pakistan and India 
who are so desperate for cash that sell- 
ing a kidney illegally looks like a viable 
solution. 

More importantly, Swanson fails to 
explore the political economy of the 
body business. The sale of blood and 
blood derivatives worldwide generates an 
annual US$23.5 billion, and the United 
States is the single largest blood exporter 
in the world. The country also exports 
corneas, coagu- 
lants, bones and Everyone 
ligaments. Ameri- S@CIS to be 
cans, in turn, pur- making money 
chase kidneys, onbody parts 
hearts, eggs and except for the 
livers, and hire donors.” 
surrogate wombs 
in foreign markets. As the markets cross 
borders and profits grow, they render 
the donors invisible. Medical outcomes 
are important; so, too, is being sure that 
the donor is unharmed in the exchange. 
Looking to the banking structures of the 
past might not be the best way to create 
ethical supplies in the future. 

For scholars interested in the narrow 
confines of blood, milk and sperm banks 
in the United States, Swanson’s work is 
a comprehensive historical sweep. How- 
ever, without an economic analysis, 
her examination of banking metaphors 
offers an incomplete picture of a complex 
issue. m 


Scott Carney is the author of The Red 
Market (2011) and a senior fellow at 
the Schuster Institute for Investigative 
Journalism. 

www.scottcarney.com 
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Stratospheric folly 


Tim Kruger examines an argument against injecting 
aerosols into the atmosphere to counter climate change. 


s evidence of climate change piles up 
A= pessimism grows over climate 
negotiations, discussion of geo- 
engineering — deliberate large-scale inter- 
vention in Earth's natural systems — is rising 
up the agenda. In the succinct Can Science 
Fix Climate Change?, climate scientist Mike 
Hulme focuses on a proposal to cool Earth 
by injecting aerosols into the stratosphere to 
reflect solar radiation. He provides a lucid 
counterpoint to A Case For Climate Engineer- 
ing (MIT Press, 2013) by David Keith, a lead- 
ing proponent of research into this approach. 
Hulme believes that deploying strato- 
spheric aerosols is wrong-headed — unde- 
sirable, ungovernable and unreliable. He 
devotes a chapter to each. 

On undesirability, Hulme explains that 
although the approach would reduce global 
temperatures, it would also have significant 
side-effects, such as changing local rainfall 
patterns. But Hulme compares the down- 
sides with the climate of today, rather than 
with that of a climate-changed future. This 
is the equivalent of condemning a drug for 
having side-effects in healthy people before 
even considering whether the benefits would 
outweigh any side-effects in the ill. He thus 
avoids sullying himself with the considera- 
tion of what would constitute the lesser of 
two evils — a climate-changed world with- 
out stratospheric aerosols, or one with them. 

Hulme is on much stronger ground in look- 
ing at whether such approaches are govern- 
able. He argues that “If the deployment of the 
technology cannot conceivably be adequately 
governed, then the technology itself should 
not be researched”. He places the ball firmly 
in the court of those proposing research, urg- 
ing them to explain up front what regulation 
should consist of. He puts forward three mod- 
els —a multilateral United Nations process, a 
consortium-based regime or deployment by a 
single country — all of which he finds want- 
ing. Least unfavoured would be a multilateral 
approach, but Hulme questions whether this 
could deliver, citing the stalled international 
climate negotiations as evidence of imprac- 
ticality. A consortium — a coalition of the 
willing who agree on how to set the global 
thermostat — he condemns as too flimsy to 
govern a technology effectively for the requi- 
site decades. And he views deployment by a 
single country as deeply unsatisfactory. 

It is clear that comprehensive governance 
needs to be in place before deployment, but 
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is it necessary before 
research can be done? 
Hulme argues that it 
is, invoking a slippery- 
slope argument — that 


once research is started 

it will lead inexorably 

to deployment. But 

although the phenom- 

aie aca enon of technologi- 

Eucla and socilogiea 

Against Climate ock-in is frequently 

Engineering remarked on, so too 

MIKE HULME is the observation that 
Polity Press: 2014. most innovations fail. 

Research may show 


that stratospheric aerosols are far from a sat- 
isfactory solution. Hulme argues that their 
action is unreliable, reeling off a long list of 
known environmental, social, political and 
ethical issues, and anticipating others yet 
unknown. Hulme argues that this list is so 
overwhelming that it is not worth even start- 
ing such an exploration. 

Having rejected the case for this technol- 
ogy, Hulme concludes with an alternative 
plan for tackling climate change. He reiter- 
ates the agenda of The Hartwell Paper: A New 
Direction For Climate Policy After The Crash 
Of 2009 (LSE, 2010), calling for “reduc- 
ing weather risks; improving air quality; 
innovating in the search for cheap, reliable, 
clean energy”. And he emphasizes the moral 
character of the challenge we face: “It is not 
climate change that is the ultimate threat to 
human well-being. It is the lack of virtue.” 

The question left hanging is whether 
Hulme’s prescription will be adopted. Would 
that it were. 

One can embrace the Hartwell agenda 
while recognizing that even if it were to suc- 
ceed, it would lead to a warming world. And 
while we await a social tipping point — a 
global renaissance of humility, compassion 
and justice — we would be well-served by 
adopting those virtues in consideration of 
imperfect remedies. Addressing the knotty 
problems of climate change is more than 
an issue of virtue or wickedness. Humanity 
may yet find itself in the position of having 
to decide which option is the least worst. m 


Tim Kruger manages the Oxford 
Geoengineering Programme at the 
University of Oxford, UK. 

e-mail: tim.kruger@oxfordmartin.ox.ac.uk 
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US sanctions put 
research at risk 


Scientific collaboration during 
the cold war was one of the 

few links between the United 
States and the Soviet Union. It 
is sad and ironic that today’s 
partnerships may now bear the 
brunt of US sanctions as a result 
of the situation in Ukraine (see 
Nature 508, 162; 2014). 

Earlier this month, the FBI 
warned universities in Boston, 
Massachusetts, about venture- 
capital partnerships with Russia 
(see go.nature.com/oyxpth). The 
US government is also severing 
scientific exchange by NASA 
and the Department of Energy 
with scientists in Russia. These 
moves are causing widespread 
consternation in the Russian 
research community. 

Former Soviet and now Russian 
scientists have traditionally 
subscribed to democratic and 
pro-Western agendas (the nuclear 
physicist Andrei Sakharov is one 
famous example). Moreover, 
researchers, engineers and 
entrepreneurs of Russian origin 
have contributed significantly to 
the US enterprise. Indeed, after 
the collapse of the Soviet Union, 
the US government issued 
fast-track green cards to Russian 
scientists to retain their expertise. 

By contrast, the new 
US policies are likely to be 
counterproductive. The 
presumption that Russian 
scientists are potential spies will 
force them to return to or stay 
in Russia, where research is in 
disarray because of sweeping 
reforms. Researchers whose 
projects are derailed are likely to 
blame the United States rather 
than Russia — a situation that 
could further damage relations. 

There are signs that 
Russian scientists with foreign 
collaborators may come under 
more state control. It seems that 
both US and Russian bureaucrats 
lack understanding of the 
importance of international 
scientific collaborations and the 
way modern science works. 
Konstantin Severinov Rutgers 


University, New Jersey, USA; and 
Skolkovo Institute of Science and 
Technology, Russia. 
severik@waksman.rutgers.edu 


Ecology must seek 
universal principles 


I disagree with your advice to stop 
searching for universal principles 
in ecology (Nature 507, 139-140; 
2014). It is fortunate that Darwin 
held no such view. And in this era 
of big data, big theories are still 
needed — even in ecology. 

The “caveats and exceptions” 
in existing theory, which you 
imply scuttle efforts to build 
better theory, are the very 
foundation for improvement. 
This is how science progresses; 
ecology is no exception. 

You recommend that 
“ecologists should embrace the 
non-predictive side of their 
science’, but prediction is part of 
what distinguishes science from 
other worthwhile endeavours; 
it enables us to strengthen and 
refine our understanding. 

Using the wolves in 
Yellowstone National Park as an 
example to support your view is 
ironic. Top-down trophic control 
in multispecies populations is not 
a theory: it is an oversimplified 
conceptual model — a good story 
that is known to apply sometimes. 
It remains to be seen whether 
a unified theory of trophic, 
competitive and facilitative 
interactions among species can 
be achieved, but ecologists should 
not be advised to give up now. 
John Harte University of 
California, Berkeley, USA. 
jharte@berkeley.edu 


Mental health: more 
than neurobiology 


The decision by the US National 
Institute of Mental Health 
(NIMH) to fund only research 
into the neurobiological roots of 
mental disorders (Nature 507, 
288; 2014) presumes that these all 
result from brain abnormalities. 
But this is not the case for many 
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people with mental-health issues 
and we fear that this policy could 
stall clinical insight into mental 
illness for years to come. 

There is no consistent 
biological evidence to support 
the idea that all mental disorders 
are due to brain dysfunction. 
Mood and anxiety, for example, 
are multifactorial and depend 
on biological, psychological and 
environmental factors. 

The NIMH’s assumption of 
underlying neural mechanisms 
presupposes that symptoms 
of specific disorders cluster 
because they have the same 
biological cause. However, 
psychopathological symptoms 
of disorders such as depression 
differ dramatically in their 
causes and genetic context and 
do not have a common biological 
background (E. I. Fried et al. 
Psychol. Med. http://doi.org/ 
193; 2013). Symptoms are more 
likely to cluster because of causal 
connections — for example, 
insomnia and its side effects are 
widespread features of various 
psychiatric conditions. 

Eiko Fried, Francis Tuerlinckx 
University of Leuven, Belgium. 
eiko.fried@gmail.com 

Denny Borsboom University of 
Amsterdam, the Netherlands. 


Mental health: drug 
search on risky path 


I fear that the requirement by the 
US National Institute of Mental 
Health for investigators to come 
up with a hypothesis as an initial 
part of the drug-discovery 
process (Nature 507, 273 and 
288; 2014) will not end well. 

The discovery of all early and 
most recent drugs has hinged 
on testing the effects of small 
molecules on the phenotype 
of cells and organisms 
(M. J. Keiser et al. Biochemistry 
49, 10267-10276; 2010). A 
hypothesis is not needed 
(D. B. Kell and S. G. Oliver 
BioEssays 26, 99-105; 2004) and 
finding a drug's likely targets 
can be postponed until it is 
known whether the drug works. 


© 2014 Macmillan Publishers Limited. All rights reserved 


It is now more fashionable to 
start the drug-discovery process 
with a hypothesis on how a drug 
might act on (typically) a single 
target, then to test how effective 
it is. But if the hypothesis turns 
out to be wrong, which is not 
unusual (D. B. Kell FEBS J. 280, 
5957-5980; 2013), money and 
effort will have been wasted. 
Douglas Kell University of 
Manchester, UK. 
dbk@manchester.ac.uk 


Restore public trust 
in care.data project 


Public confidence has already 
been undermined in the National 
Health Service's care.data 
programme in England, which 
will integrate medical records 
into a database for health-care 
and research purposes (Nature 
507, 7; 2014). We suggest that 
the UK government's decision to 
limit release of the data should be 
the start ofa process to develop 
public trust in large-scale health 
research, not an end to it. 

Care.data first needs to provide 
more information to the public 
about how medical records will 
be used. Openness and clarity 
are essential. For example, the 
UK Health and Social Care 
Information Centre (which 
manages the data) can currently 
share pseudonymized data with 
‘such persons and in such form 
and manner ...as it considers 
appropriate’: these choices should 
be fully explained to the public. 

A proposed amendment to the 
Health and Social Care Act 2012 
might narrow this discretion so 
that data can be shared only for 
‘the promotion of health’. But even 
this is open to wide interpretation. 

Formal regulation is only 
one way to reassure the public. 
Another is to use online interfaces 
that allow research participants 
to access information and to set 
consent preferences (J. Kaye et al. 
Eur. J. Hum. Genet., in the press). 
Colin Mitchell, Linda Briceno 
Moraia, Jane Kaye University of 
Oxford, UK. 
jane. kaye@law.ox.ac.uk 
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Catalysis at the boundaries 


Copper-based materials have been found that efficiently convert carbon monoxide and water to ethanol using electricity. 
The discovery is a major advance towards storing renewable energy in the form of a liquid fuel. SEE LETTER P.504 


AARON M. APPEL 


remarkable improvement in a catalyst 
A the electrochemical production 

of carbon-based fuels from carbon 
monoxide and water is reported by Li et al.’ 
on page 504 of this issue. Although electrodes 
made from copper have been known for many 
years to catalyse the transformation of carbon 
dioxide to chemicals and fuels’, through inter- 
mediates such as CO, the authors have dem- 
onstrated that the method used to prepare 
copper electrodes has a substantial effect on 
the activity and energy efficiency of these cata- 
lysts. This result may lead to improvements in 
how renewable energy sources are exploited. 

Renewable sources such as sunlight and 
wind provide an opportunity to power society 
without the negative consequences that accom- 
pany the use of fossil fuels, but their intermit- 
tent availability is a major limitation. Efficient 
means of storing energy from these sources 
must therefore be found in order to facilitate 
their widespread use’. An attractive approach 
is the use of electricity to drive the production 
of fuels from water and CO, (refs 3-6; Fig. 1). 
Unlike conventional, centralized fuel produc- 
tion, electrochemical systems can operate at 
mild pressures and temperatures in small-scale 
reactors — making them ideal for producing 
fuels at sites of renewable energy sources, which 
are inherently dispersed. However, the develop- 
ment of efficient catalysts will be essential for 
the intended chemical transformations. 

The electrochemical conversion of CO, 
into fuels is a multi-step process that has 
many potential products and intermediates. 
The choice of catalyst has a profound effect 
on the selectivity and energy efficiency of this 
process. The first product is often CO: this gas 
has a low energy density, so further transfor- 
mation is required to make an effective fuel. 
Although copper catalysts are known to work 
for the initial reduction of CO, (refs 2, 7), Li 
and colleagues focused on the subsequent 
transformation of CO. They report that, 
when copper is oxidized and then reduced back 
to copper in a controlled way, the resulting 
catalytic metal surface not only produces 
ethanol more selectively than previous cata- 
lysts, but is also more energy efficient. This dis- 
covery is a great step towards the cost-effective 
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Figure 1 | Storing energy from renewable sources. Renewable energy sources, such as solar energy, 
are only intermittently available, so a means of storing the energy is needed. If the energy is converted 
into electricity, this could be used to drive electrochemical reactions that convert carbon dioxide into 
liquid fuel, often with carbon monoxide (not shown) as an intermediate; when the fuel is later used to 
do work, CO, is released, and can be recycled into fuel. Li et al.’ report improved copper catalysts for the 
electrochemical conversion of CO to ethanol, a potential liquid fuel. 


production of renewable liquid fuels with high 
energy densities. 

One of the long-standing challenges in 
heterogeneous catalysis, in which reactions 
typically occur at the surface of a material, is 
to understand the causes of changes in cata- 
lytic performance. The surface of a material 
usually bears a variety of structural features, 
which are potential catalytic sites for the key 
reactions. Identifying which of these is respon- 
sible for catalysis requires the characterization 
of materials by numerous techniques, followed 
by attempts to correlate changes in surface fea- 
tures with changes in catalytic performance. 

Li et al. analysed the copper catalysts pro- 
duced by their oxidation-reduction process, 
and found that the size of the particles formed 
does not seem to explain their catalytic activ- 
ity — similarly sized copper particles prepared 
using a different method did not have similar 
catalytic activity. The researchers propose that 
the different activities of the catalysts produced 
using the two methods derive from grain 
boundaries, the junctions between crystals 
within the particles. Although particles pre- 
pared using different methods may be similar 
in size, their grain boundaries vary substan- 
tially. These multidimensional ‘defects’ may be 
the key to the significant enhancements seen 
in selectivity and energy efficiency. 

The insight gained from Li and co-work- 
ers’ results lays the groundwork for further 
advances. As with all electrocatalysts, the 
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energy efficiency of ethanol formation in the 
authors’ system could be improved by decreas- 
ing the electric potential required for reaction 
and by increasing the product selectivity. But 
a more remarkable advance may come from 
studying the carbon-carbon bond-forming 
process that connects CO molecules to form 
two-carbon molecules (such as ethanol). The 
selective formation of carbon-carbon bonds 
is a key step in a variety of catalytic transfor- 
mations, not only for industrial applications, 
but also in biological systems. Enzymes con- 
tain precisely positioned chemical groups that 
control bond-forming and bond-breaking 
reactions’, and the variations in activity and 
product selectivity observed by Li et al. could 
be due to the formation of similarly multi’ 
functional catalyst structures. An understand- 
ing of this process could lead to the production 
of fuels that contain more carbon atoms (such 
as butanol), which have an even greater energy 
density than ethanol. 

Liquid fuels produced from CO, and its 
derivative CO face considerable challenges in 
the fuel market. One of the biggest problems 
is that fossil fuels have an inherent advantage, 
because energy was stored in them — by pre- 
historic photosynthesis — at no cost. By con- 
trast, the energy used to produce fuels from 
renewable sources must be paid for. At present, 
this expense is substantial, but the cost rela- 
tive to that of fossil fuels will probably decrease 
with time. Because of the challenges of 
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predicting fossil-fuel prices, renewable-energy 
production costs, and incentives (such as car- 
bon taxes) to use renewables in place of fossil 
fuels, the timetable for widespread adoption of 
renewable fuels is not clear. But the production 
of fuels from non-fossil sources will certainly 
require effective catalysts. Li and colleagues’ 
work is an excellent step in that direction. = 


Aaron M. Appel is in the Catalysis Science 


SENSORY SYSTEMS 


Group, Pacific Northwest National Laboratory, 
Richland, Washington 99352, USA. 
e-mail: aaron.appel@pnnl.gov 
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Do you hear what I see? 


Researchers have found evidence that the representation of auditory and tactile 
information in the brains of blind people shows strong similarities to the way in 
which visual information is represented in sighted people. 


IONE FINE 


Imost one-quarter of the brain is 
Av devoted to processing visual 

information: reading text, recogniz- 
ing faces, following the Sunday match, and 
much more. The brain’s visual cortex contains 
specialized regions devoted to process- 
ing motion, text, faces, scenes, objects 
and even the position and movement of 
bodies. In congenitally blind individu- 
als, much of the ‘visual’ cortex responds 
strongly to auditory and tactile input 
rather than to visual stimuli, a phenom- 
enon known as cross-modal plasticity. 
Writing in Current Biology, Striem-Amit 
and Amedi' shed light on how these 
cross-modal responses are organized. 

The researchers trained congeni- 
tally blind individuals for 70 hours in 
the use of a technology’ that converts 
visual information into ‘soundscapes’ 
(called vOICe). After an initial click, 
an image frame is scanned from left to 
right and then represented by a sequence 
of ‘chords: Brightness is represented by 
loudness, height in the scene is repre- 
sented by pitch and horizontal position 
is represented by the time since the click. 
For example, a diagonal line stretching 
upward from left to right becomes the 
sound of an ascending sweep. 

The authors used functional magnetic 
resonance imaging to measure the brain 
responses of participants while they 
listened to soundscapes of body-shape 
silhouettes, objects, faces and abstract 
patterns. These brain responses were 
compared with those of sighted individ- 
uals viewing the corresponding visual 
images. Both blind and sighted subjects 
showed larger responses to body-shape 
silhouettes than to other images within 


a region of the brain known as the extrastriate 
body area (EBA), which, in individuals with 
normal sight, is involved in visual perception 
of the human body. 

One of the most interesting conclusions 
that Striem-Amit and Amedi draw from their 
results is that the specialization for body-shape 


Figure 1 | Using sound instead of sight. In goalball, a sport 
designed for blind athletes, players use sound to determine the 
position of both the ball and the other players. Striem-Amit 
and Amedi’ report that auditory stimuli are processed in blind 
people by a region of the brain that is associated with vision in 
sighted individuals. 
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information in the EBA is innate, requiring 
little experience to develop, because selectivity 
for body-shape silhouettes in this brain area 
occurs even in blind individuals with little or 
no experience of body shape before training 
with vOICe. Thus, the authors interpret their 
findings as evidence that the specialization 
found in the visual cortex may exist indepen- 
dently of experience. 

This claim is based on the assumption that 
congenitally blind individuals have a deeply 
impoverished experience of others’ body pos- 
tures, actions and movements. Indeed, the 
authors claim that this information is avail- 
able to blind individuals solely through touch, 
and that they have no auditory experience of 
body shape before vOICe training. However, 
in the course of our research, my laboratory 
has noticed that many visually impaired 
individuals report knowing a surprising 
amount about body positions, actions 
and intents from sound alone. Visually 
impaired subject Mike May reports: “I 
know when people are getting bored by 
my presentation because I can hear them 
shuffling in their chairs.” A fully blind col- 
league, Nick Giuduce, admits: “When I'm 
interested in a woman, I can tell if she’s 
slender by how much the floor creaks 
as she walks by.’ Amy Burk, the Cana- 
dian goalball player shown on the right 
in Figure 1, reports: “I can see it all ... 
Ican visualize it all ... if they are moving 
with the ball ’'m moving with them ... 
whatever side they are going on I’m 
following the person.” It is reasonable to 
suggest that the EBA has a role in inter- 
preting these auditory experiences in 
blind individuals. 

Regardless of whether auditory 
experience before training with vVOICe 
involves the EBA, this study adds to a 
body of work suggesting that ‘functional 
constancy’ is one of the organizational 
principles underlying cross-modal 
plasticity in blind people*. According to 
this principle, specialized brain regions 
continue to serve the same function in 
blind individuals, but there is a shift in 
the regions’ primary sensory input from 
sight to hearing or touch. A variety of 
studies have provided evidence for func- 
tional constancy in blind individuals. 
These showed that the visual-motion 
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area of the brain responds to auditory and 
tactile motion**, that sounds made by objects 
are represented in brain regions associated 
with visual-object recognition”, and that read- 
ing Braille elicits brain responses in the visual 
word-form area’. 

There is an appealing elegance to the 
idea that the representation of auditory and 
tactile information in the ‘visual’ cortex of 
blind individuals is analogous to that of sighted 
people. But the extent of the apparent similar- 
ity may be partly a consequence of hypoth- 
eses being tested from a sighted perspective. 
Experiments published so far have relied on 
measuring brain responses to auditory stim- 
uli that have been experimentally selected 
and categorized on the basis of what makes 
intuitive sense to us as sighted scientists. This 
may result in an overemphasis of apparent 
similarities. 


INFECTION BIOLOGY 


More-detailed examination of the response 
profiles of seemingly analogous areas of the 
brain using a wider variety of stimuli may yet 
reveal important differences between blind 
and sighted individuals, both in the way that 
information is represented and in the function 
of these areas with respect to perception and 
behaviour. Indeed, in their discussion, Striem- 
Amit and Amedi note that the representation 
of bodies is not identical in blind and sighted 
individuals. A second region of the brain that 
seems to represent body shape, the fusiform 
body area (which may contain a more holi- 
stic representation of body position), showed 
reduced selectivity to body-shape information 
in the blind individuals they studied. Moreover, 
the intriguing possibility that blind people have 
specialized regions that contain representations 
of the world with no sighted equivalent remains 
almost entirely unexplored. m 


Nibbled to death 


Trogocytosis, a process in which one cell ‘takes a bite’ out of another, had previously 
been seen only in immune cells. But the phenomenon has now been found in 
Entamoeba histolytica, as a way for this parasite to kill host cells. SEE LETTER P.526 


NANCY GUILLEN 


arly in an immune response, antigen 
eer are captured by antigen- 

presenting cells and displayed on the 
cells’ surface, where they are recognized 
by receptors on the surface of lymphocytes 
(B cells, T cells and natural killer cells)'”. 
The lymphocyte then becomes joined to the 
antigen-presenting cell and extracts these sur- 
face molecules along with patches of the cell 
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Figure 1 | Amoebic trogocytosis. Ralston et al.’ have described trogocytosis 
by Entamoeba histolytica, in which the parasites tear off and ingest patches 

of host cells, resulting in their death. The interaction between the cells is 
mediated by abundant surface molecules, including glycoproteins and their 
attached carbohydrate chains. The cell-to-cell contact is then stabilized 
through adhesion molecules (not shown), and cytoskeletal activity in the 
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membrane. This process, termed trogocyto- 
sis (from the Greek word trogo, to nibble)”, 
activates the lymphocyte to initiate a specific 
immune response to that antigen. Until now, 
trogocytosis had been observed only between 
immune cells. But in this issue, Ralston et al.* 
(page 526) describe a form of trogocytosis 
carried out by the parasite Entamoeba histo- 
lytica, and suggest that this process mediates 
the destruction of intestinal cells that is seen 
in amoebiasis — the gastrointestinal infection 


Ingestion 


Src, Syk, PI3K 


Priming 
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caused by these unicellular organisms. 
Trogocytosis in immune cells requires the 
transduction of signals from the acceptor-cell 
surface by means of kinase enzymes such as 
Src, Syk and PI3K, and by modulation of the 
cell’s cytoskeleton (which is rich in the pro- 
tein actin) and of intracellular calcium-ion 
(Ca”*) levels”. It is a rapid process, occurring 
within minutes of co-culturing the participant 
cells in vitro”. Notably, despite the exchange of 
material, neither the antigen-presenting cell 
nor the lymphocyte dies following trogocytosis. 
In the first description of the process’, it was 
proposed that trogocytosis may have evolved 
as a way for cells to acquire nourishment from 
other cells, and later as a means of intercellular 
communication. The hypothesis of an ancient 
origin for trogocytosis is now supported by 
Ralston and colleagues’ observation ofa simi- 
lar mechanism in E. histolytica, an ancient 
organism. However, unexpectedly, this form of 
trogocytosis enhances the parasite’s virulence, 


Amplified Cell death 


ingestion 


parasite is probably involved in generating the force required to pinch off 

the host-cell membrane. This transfer of cellular material causes changes in 
intracellular calcium-ion levels and triggers signalling pathways in both cells. 
The activity of Src, Syk and PI3K enzymes contributes to this process and 
leads to ‘priming’ of the parasites, which amplifies subsequent ingestion. 
After successive nibbling events, the host cell dies and the parasite moves on. 


and results in killing of the target cells. 

During amoebiasis, E. histolytica resides in 
the colon of an infected individual, where it 
depletes it of mucus, interacts with the exposed 
enterocyte cells lining the colon, dismantles 
the junctions between them and causes their 
death by lysis. The previous model’ for the 
parasite’s action was that it attaches itself 
to host cells and kills them, by an as-yet 
unclear mechanism involving the insertion of 
‘amoebapore peptides into the cell membrane 
and subsequent lysis. It was also thought that 
the parasites engulf and ingest dying entero- 
cytes by phagocytosis. But Ralston et al. instead 
show that the parasite ingests pieces of the host 
cell, and that this nibbling occurs in a repeated 
manner that ends up killing the cell. Once kill- 
ing is achieved, the amoebae move on (Fig. 1). 

This trogocytosis-like process is fundamen- 
tally different from amoebic phagocytosis, in 
which the parasite entirely ingests cells such as 
red blood cells (including dead cells). Although 
it is not known what determines the parasite’s 
choice between phagocytosis and trogocytosis, 
some of the authors’ findings point to mecha- 
nisms similar to those suggested in immune 
cells’ that can perform trogocytosis in parallel 
with phagocytosis. For instance, both processes 
require an active actin-rich cytoskeleton and 
associated signalling pathways. 

Interestingly, the authors’ data show that 
parasites that have previously performed 
trogocytosis are ‘primed’ for higher ingestion 
activity and elicit more host-cell killing than 
parasites that have not, demonstrating that 
trogocytosis changes the parasites’ behaviour. 
The authors have also produced images of 
trogocytosis in living tissue for the first time, 
and show that the process is required for the 
parasites to invade the tissue, leading to the 
pathogenic consequences for the host. They 
further show that the parasites can trogocytose 
all of the cell types tested, including entero- 
cytes, lymphocytes, intestinal-tissue cells 
and red blood cells; because these cells have 
different surface constituents, it seems likely 
that a ubiquitous interaction occurs at the cell 
surfaces during trogocytosis. 

It can thus be argued that the first step in 
amoebic trogocytosis — cell-to-cell attach- 
ment — is mediated by general components 
of the cell surface (for example, the glycocalyx, 
which is rich in glycoproteins and glycolipids), 
rather than by specific cell receptors such as 
the T-cell receptor (TCR), as is observed in 
immune-cell trogocytosis. The most relevant 
glycosylated (carbohydrate-containing) com- 
ponents on the parasite cell surface, in terms 
of abundance, are lipopeptidophosphoglycans 
and Gal/GalNAc lectin’. Glycosylated residues 
on these amoebic components might become 
attached to glycosylated components on the 
donor cells. The data obtained by Ralston et al. 
support this hypothesis, because Gal/GalNAc 
lectin is essential for the process. 

Despite their respective cell specificity and 


ubiquity, there are interesting correlations 
between the TCR and Gal/GalNAc lectin in 
terms of signal-transduction characteristics: 
antigens bind to the TCR with relatively low 
affinity, similar to the predicted weak inter- 
actions between glycosylated residues and 
the Gal/GalNAc lectin; the activation of both 
intracellular signalling pathways involves the 
dynamic linking of the cell-surface molecules 
into microclusters*” and requires a short cyto- 
plasmic domain; and both pathways are asso- 
ciated with Src-kinase activity in the acceptor 
cell. However, these are only correlated 
features, because there is no evidence for 
structural homology between the TCR and 
Gal/GalNAc lectin. 

It is also interesting to speculate on the 
significance of the intensification of amoebic 
trogocytosis in primed parasites. One possible 
explanation is that surface components of the 
host cell activate specific signalling pathways 
that enhance the parasite’s affinity for extra- 
cellular carbohydrates. This idea is based on 
recent findings indicating that carbohydrate 
metabolism is involved in amoebic patho- 
genesis’. Are these signals necessary for 
killing donor cells by amoebic trogocytosis, 
but not for lymphocytic trogocytosis? Com- 
prehensive comparisons of gene expression 
between primed and naive parasites (and in 
lymphocytes) might help to explain the roles 
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of these activated signals. Further studies may 
also reveal whether amoebic trogocytosis and 
phagocytosis occur simultaneously, and what 
specific signals result from each process. But 
although such details remain to be determined, 
this new concept of amoebic trogocytosis 
is important for understanding not only 
amoebiasis, but also host-pathogen inter- 
actions in other systems and in immune-cell 
function and interactions. = 
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Y chromosome 


Comparisons of Y-chromosome sequences in various mammals reveal abundant 
gene loss early in the chromosome’s evolution but remarkable gene stability 
across the Y chromosomes of extant species. SEE ARTICLES P.488 & P.494 


ANDREW G. CLARK 


he evolution of the mammalian X and 

Y sex-determining chromosomes from 

ancestral chromosomes is thought 
to have occurred through a rapid loss of 
genes from the Y chromosome. This idea of 
rapid degeneration’ has been bolstered by 
observations made during the emergence of 
new Y chromosomes or Y chromosome seg- 
ments in fruit flies”. In this issue, Bellott et al.? 
(page 494) and Cortez et al.* (page 488) pre- 
sent extensive accounts of gene evolution on 
the Y chromosome. They show that, although 
there was a period of rapid degeneration and 
gene loss during its early evolution, the genes 
that are conserved across the Y chromosomes 
of extant mammals (and the sex-determining 
W chromosomes of birds) have since been 
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remarkably stable. The researchers’ data also 
provide a detailed picture of the evolution- 
ary forces acting on the sex chromosomes, 
and offer a plausible explanation for the func- 
tional coherence of Y-linked genes across 
these species. 

The Y chromosome is notoriously chal- 
lenging to study, in terms of both genetics and 
molecular biology. Despite the fact that male 
genomes were included in early whole-genome 
sequencing projects, the Y chromosome was 
largely ignored owing to the challenge of 
obtaining useful data from the chromosome, 
which is rich in repetitive and palindromic 
sequences. Bellott et al. adopted a previously 
described approach’ — cloning regions of 
the DNA of interest into bacterial artificial 
chromosomes — to obtain and assemble 
DNA sequences from the Y chromosomes 
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50 Years Ago 


‘Extracorporeal perfusion of the 
isolated head of a dog’ — Critical 
evaluation of cerebral metabolism 
and intracranial fluid distribution 
necessitates complete isolation of 
the brain's blood supply; however, 
brain viability must be demonstrated 
and maintained for such studies 

to be meaningful ... In order to 
minimize handling of the brain 
substance, a factor which may 
disturb fluid distribution and 
cerebral metabolism, we have 
chosen to leave the brain within 
the skull during perfusion ... Most 
cortical activity ceased when blood 
glucose was depleted ... Even after 
the electrocortical activity ceased, 
corneal and lid reflexes remained 
intact and the oxygen and glucose 
consumption continued ... From 
our experience, we believe that 
electrocortical activity is a sensitive 
index of brain viability, in that it 

is lost long before inactivation of 
corneal and lid reflexes or cessation 
of metabolism. In this preparation we 
have demonstrated that the dog brain 
maintains this activity for several 
hours after complete decapitation. 
From Nature 25 April 1964 


100 Years Ago 


The second reading of a Bill to 
prohibit experiments on dogs was 
carried in the House of Commons 
on Friday last, April 17, by a majority 
of forty-two ... It was stated on 
behalf of the Government that 

an amendment will be moved 

in Committee to abolish the 
proposed prohibition and to allow 
experiments only in cases where no 
other animal but a dog is available 
for the purpose ... Before the second 
reading was taken, a memorial 
signed by more than three hundred 
eminent physicians, surgeons, 

and other scientific investigators, 
protesting against the measure, was 
addressed to the Home Secretary. 
From Nature 23 April 1914 
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of four placental mammals (rat, mouse, bull 
and marmoset) and the marsupial opossum. 
They compared these with existing sequences 
for another three placentals (rhesus macaque, 
chimpanzee and human). Of the 184 genes that 
the authors infer to have been on the ances- 
tral sex chromosomes some 300 million years 
ago, they find that only 3% survive on the 
Y chromosome of one or more of these 
mammals (Fig. 1). 

Consistent with previous reports, this means 
that massive degeneration and gene loss did 
occur early in the history of the mammalian 
Y chromosome. However, once the genes had 
run this gauntlet, those that remained enjoyed 
remarkable stability on the Y chromosome. 
The authors also find that the 36 genes that are 
present on both the X and the Y chromosomes 
of all eight species they examined have main- 
tained a stable presence for the past 25 million 
years. Ten genes were found to be shared across 
the Y chromosomes of the tammar wallaby, the 
Tasmanian devil and the opossum, indicating a 
stable Y-chromosome presence for the 78 mil- 
lion years of the marsupial lineage. These 
findings have important implications for our 
understanding of how natural selection acts to 
retain active functioning of specific subsets of 
genes on the Y chromosome. 

Cortez et al. took a faster survey approach, 
in which they sought RNA molecules that are 
expressed in males but not females and then 
verified that the genes encoding these RNAs 
are found only in male genomic DNA. This 
allowed them to identify 134 genes transcribed 
from the Y chromosome across 10 mammals 
and to follow their evolutionary fates. By 
including the chicken (in which males have 


two Z chromosomes and females have one Z 
and one W chromosome) and the platypus 
(a monotreme that has a bizarre array of five 
X and five Y chromosomes), the authors were 
able to paint a broader picture of sex-chro- 
mosome evolution. Most noteworthy is their 
observation that the sex chromosomes of 
placental mammals, birds and monotremes 
had essentially independent origins, which 
means that patterns of gene loss and of specific 
retention of classes of genes on their Y (or W) 
chromosomes can be compared. 

These data add depth and confidence to the 
model of evolutionary ‘strata’ on the sex chro- 
mosomes’ that mark the time points at which 
X and Y sequences ceased recombining and 
subsequently diverged. Intriguingly, despite 
their independent origins, the authors find 
that the oldest strata in placental mammals, 
monotremes and birds are remarkably similar 
in age, estimated to have occurred 181 mil- 
lion, 175 million and 137 million years ago, 
respectively. 

Another key aspect of genes on sex chromo- 
somes is dosage sensitivity. Dosage-insensitive 
genes are those that function perfectly well 
when present as a single copy, and these are 
especially likely to become X- or Y-specific. By 
contrast, two copies of dosage-sensitive genes 
are required for normal health, and such genes 
are likely to be retained on both the X and the 
Y chromosome’. Genes involved in regulating 
gene transcription — such as those that encode 
transcription factors — commonly function 
inadequately in only a single dose, provid- 
ing a hypothesis for why the Y chromosome 
has retained genes involved in transcription 
regulation. 


Figure 1 | Small but stable. The human Y chromosome (right) is much smaller than the X chromosome 
(left), as a result of extensive degeneration early in Y-chromosome evolution. However, comparisons with 
other mammalian Y chromosomes by Bellott et al.’ and Cortez et al.* show that there has been remarkable 
gene stability across Y chromosomes following this initial gene loss. 
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POWER AND SYRED/SPL 


Because the Y chromosome is enriched with 
transcription-regulating genes, this means that 
it is far from being solely a male-determining 
switch that is flipped early in development. 
Instead, the Y chromosome has an impact on 
gene regulation across the genome in males, 
potentially influencing biological functions 
throughout life and in every tissue. It is fair 
to say that we are only beginning to under- 
stand the full extent of the differences in the 
molecular biology of males and females, and 
unanswered questions abound. For example, 
to what extent are male-female differences 
driven by specific interactions with Y-chromo- 
somal factors? 

In humans, the level of variation between 


CLIMATE SCIENCE 


individuals is considerably lower on the 
Y chromosome than on other chromosomes. 
However, Y-linked sequence changes can cause 
changes in gene expression across the genome, 
which could result in amplified differences 
among males. Despite the relative stability of 
the gene content on mature Y chromosomes, 
it is well known that DNA sequences evolve 
faster on the Y chromosome than on the X. 
Although this is generally perceived to be the 
result of the arrest of genetic recombination on 
the Y chromosome leading to reduced effec- 
tiveness of natural selection’, it seems that the 
Y chromosome also has the potential to medi- 
ate remarkably rapid adaptive evolutionary 
change. m 


Sea levels from 
ancient seashells 


The isotopic composition of oxygen in sea water correlates with changes in global 
mean sea level. Microfossils carrying oxygen-isotope signals have been used to 
extend sea-level records as far back as 5 million years ago. SEE ARTICLE P.477 


RALPH SCHNEIDER 


n page 477 of this issue, Rohling et al.' 

present a convincing approach for 

calculating sea-level fluctuations over 
the past 5 million years (Myr). Their method 
depends on variations in the oxygen-isotope 
composition of shells produced by unicellu- 
lar organisms called planktonic foraminifera 
(Fig. 1). In this way, the authors provide much- 
needed information that should help to pre- 
dict future rates of sea-level rise in the event 
of complete or partial melting of the ice caps 
over Greenland and Antarctica in response to 
global warming. 

The most recent instances of ice-sheet 
growth and melting that generated sea-level 
variation of several metres took longer than 
hundreds of years, making it impossible to 
determine their effects directly from histori- 
cal records. To infer how growing and melt- 
ing continental ice sheets affect sea level, at 
least the past 500,000 years must be consid- 
ered. During this time, there were five periods 
of sea-level rise of up to 100m or more, 
corresponding to the terminations of recent 
glacial periods (the intervals of time within 
the current, ongoing ice age that, in general, 
correspond to colder temperatures and glacier 
advances). 

One could argue, however, that these termi- 
nations do not adequately describe what would 
happen during complete melting of the Green- 
land ice cap and parts of the West Antarctic 


Ice Sheet. Hence, it is much more appropriate 
to consider past conditions when Northern 
Hemisphere glaciation was still young — that 
is, when small, juvenile continental ice caps 
controlled sea-level fluctuations on time- 
scales of tens to hundreds of thousands of 
years (the timescales associated with variations 
in Earth’s orbit, which dominate the timing 
of glacial-interglacial cycles and sea-level 
changes). But how can this be done? The shells 
of foraminifera offer a potential solution. Made 
of calcium carbonate, they contain a record of 
the ambient isotopic composition of seawater 
oxygen during the organisms’ lifetime. 

Since the pioneering work’ of the geologist 
Cesare Emiliani in the 1950s, it has commonly 
been accepted that periodic variations in the 
ratios of oxygen-18 to oxygen-16 ('*O/'°O) 
in foraminifera preserved in deep-sea sedi- 
ments follow a global pattern characteristic 
of orbitally forced climate change during 
the Late Pleistocene epoch (about 700,000 
to 11,700 years ago). Cool temperatures and 
great ice volumes both resulted in high '*O/'SO 
ratios, whereas high temperatures and low ice 
volumes had the opposite effect. So, if the 
temperature effect can be disentangled from 
this isotopic record, then the remaining signal 
represents relative changes in continental ice 
volume. And if this signal can then be scaled to 
the amplitude of sea-level rise between glacial 
and interglacial periods (as has been done for 
the most recent postglacial period by correlat- 
ing sea-level rises to the oxygen-isotope com- 
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position of cores taken from coral terraces’), 
then fluctuations in global sea level over time 
can be calculated, as long as ocean sediments 
provide continuous, undisturbed '*O/'°O 
records. 

Going back even further in time, a 5-Myr- 
long composite record’ of °O/’°O ratios from 
foraminifera that lived in deep-sea sediments 
was until now considered the best chronicle of 
ice-sheet volume as Earth shifted from a hot, 
‘greenhouse’ climate (about 55 Myr ago) to 
colder, ‘icehouse’ conditions (approximately 
2.6 Myr ago). However, there are two pro- 
blems with this record. First, the proportions 
of the temperature and ice-volume effects in 
it are unclear, because deep-ocean temperatu- 
res may have changed substantially over this 
long period of climate transition. Second, it 
is difficult to scale deep-sea oxygen-isotope 
variations at orbital timescales into robust 
estimates (including error margins) for the 
amplitude of sea-level rise and fall over the 
past 5 Myr. 

Rohling et al. overcome these problems by 
converting '*O/"°O ratios of fossilized plank- 
tonic foraminifera that proliferated in the 
surface waters of the eastern Mediterranean 
Sea directly into sea-level variations — an 
approach previously developed for a study” 
of the Red Sea, and which does not require 
temperature and ice-volume effects to be dis- 
entangled first. Their method depends on a 
hydraulic model of water exchange through the 
Strait of Gibraltar, which connects the North 
Atlantic Ocean and the Mediterranean Sea. 
This exchange mechanism not only controls 
the balance of evaporation and water renewal 
in the Mediterranean, but also strongly affects 
the seawater oxygen-isotope ratios recorded in 
planktonic foraminifera. 

Assuming that there have been no major 
tectonic movements in the Strait of Gibraltar 
during the past 5 Myr that affect its depth and 
width, the oxygen-isotope signal from these 
foraminifera is simply a function of global sea- 
level variations relative to the modern hydrau- 
lic state of the Mediterranean Sea. The authors 
find that estimates of ancient sea levels relative 
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Figure 1 | Isotopic recorders. Planktonic foraminifera, such as Globigerinoides ruber (pictured), are 

unicellular organisms that live near the sea surface. Rohling et al.’ have used oxygen-isotope data from 
microfossils of planktonic foraminifera from the Mediterranean Sea to deduce global sea levels during 
the past 5.3 million years. 


to modern levels made using their approach 
are quantitatively similar to those obtained 
using other approaches, but have much better- 
defined error statistics. Accordingly, they con- 
firm that sea level was about 10-30 m higher 
in the warm period that preceded the current 
ice age than it was when the ice sheets sub- 
sequently expanded. 

As Rohling and co-workers acknowlege, 
their method has several limitations that 
require further study and improvement. 
Apparent mismatches exist between their 
results and those of hydraulic models applied 
to the Red Sea’, particularly for periods 
corresponding to the maximum volume 
of ice, towards the end of three of the four 
most recent glacial periods. This could be 
because the researchers’ model for converting 
planktonic oxygen-isotope data to sea levels 
might not fully account for freshwater input 
from rivers, or for perturbations in the Oxy- 
gen-isotope signal of precipitation and rivers 
over time. Nonetheless, the striking similar- 
ity between Rohling and colleagues’ results 
and those from two independent studies that 
used data from the Red Sea’ and the south- 
west Pacific Ocean’ to predict sea levels during 
the past 0.5 and 1.5 Myr, respectively, strongly 
suggest that the researchers’ conclusions for 
older periods are correct. 

As expected for any new approach, the 
findings leave several open questions. The 
conclusion that the first major sea-level fall 
took place more than 0.5 Myr after the onset 
of significant global cooling 2.75 Myr ago 
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challenges commonly held assumptions 
(including some of my own’) inferred from 
the composite record’ of '°O/'°O ratios from 
foraminifera that lived in deep-sea sediments. 


REPRODUCTIVE BIOLOGY 


But it receives a lot of support from other 
studies of regional palaeoclimatic trends for 
the period approaching the ice age in the 
Pleistocene epoch (roughly 2.6 million to 
11,700 years ago; see Extended Data Table 2 
of the paper’). 

Rohling et al. also conclude that the ice 
volume during the Last Glacial Maximum — 
the peak of the most recent glacial period — 
was larger than that during previous glacial 
periods. This contradicts continental evi- 
dence for the extension of ice sheets over North 
America and Eurasia’®, but, if correct, will serve 
as a crucial benchmark for improved model- 
ling of glacial ice-sheet volumes and heights. 
Finally, the authors’ findings should also lead 
to improved forecasts for the fate of extant ice 
caps under global warming. m 
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Sperm protein 
finds its mate 


Knowledge of the sperm-specific protein that is required for the attachment of 
sperm to eggs during fertilization in mammals has led to the identification of the 
protein’s receptor on the egg’s plasma membrane. SEE ARTICLE P.483 


PAUL M. WASSARMAN 


iology is full of surprises. Such is the 
B case with the findings of Bianchi et al." 

reported on page 483 of this issue. The 
authors report the end of a decade-long search 
for a partner for the sperm protein Izumol, 
which is responsible for sperm-egg adhesion 
during fertilization. The elusive mate is 
a member of the folate-receptor protein 
family, and is located on the plasma membrane 
of unfertilized eggs. Because of its essential role 
in fertilization, the researchers propose that 
the folate receptor, currently known as Folr4, 
should be renamed Juno, after the Roman 
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goddess of fertility and marriage. 
Fertilization defines the process by whicha 
sperm and an egg combine to form a single- 
celled zygote that, in time, gives rise toa new 
individual. In 2005, researchers reported’ that 
a protein that spans the sperm’s plasma mem- 
brane, called Izumol (named after a Japanese 
marriage shrine), is essential for fertilization 
in mammals. Male mice that lack Izumol are 
infertile — their sperm seem normal, but are 
unable to fuse with eggs. Subsequently, it was 
found that Izumol is part of a multiprotein 
family whose members form large complexes 
on sperm that may be essential for sperm-egg 
fusion’. However, the identity of the receptor 


to which Izumol binds remained a mystery, 
owing largely to inherent difficulties in work- 
ing with small numbers of mammalian eggs, as 
well as to the transient nature of extracellular 
interactions between proteins tethered to the 
membrane. 

Bianchi et al. identified the mammalian 
egg receptor for Izumol using a method 
previously developed in their laboratory for 
detecting weak interactions between membrane- 
tethered proteins’. The authors used a region of 
the Izumol protein — the part that extends into 
the extracellular space — as a binding probe, 
and found that a member of the folate-receptor 
family, Folr4, is the sole receptor for Izumol 
on unfertilized eggs. Furthermore, they con- 
firmed previous suggestions that, unlike other 
Folr proteins, Folr4 is unable to bind folic acid>® 
(a form of folate). Bianchi and colleagues found 
that Folr4, which they rename Juno, is mem- 
brane-tethered and is present on the eggs of 
mice, opossums, pigs and humans. 

The researchers examined three Izumo and 
three folate-receptor family members and 
found that only Izumol and Juno could inter- 
act with one another. Exposure of eggs to an 
antibody that blocks Juno prevented both the 
binding of Izumo] to Juno and fertilization 
in vitro. Furthermore, female mice that lacked 
Juno were infertile, and eggs recovered from 
these mice could not be fertilized by sperm 
in vitro. In this situation, the authors found that 
sperm were unable to adhere to and fuse with 
the egg’s membrane, but instead accumulated 
in the perivitelline space that lies between the 
egg’s membrane and its thick extracellular coat, 
the zona pellucida. Even when the research- 
ers removed the zona pellucida, exposing the 
plasma membrane, eggs that lacked Juno could 
not be fertilized by sperm in vitro and fewer 
sperm bound to the egg’s membrane than to 
the membranes of control eggs. Collectively, 
these results strongly suggest that the forma- 
tion of an Izumol-Juno adhesion complex is 
essential for fertilization in mammals (Fig. 1a). 

These observations beg the question: is 
the interaction between Izumol and Juno 
the only requirement for sperm-egg fusion? 
Some recent evidence’ suggests that, although 
Izumol induces surface interactions such as 
adhesion between cell membranes, it does 
not promote membrane fusion. Indeed, when 
Bianchi and co-workers induced expression 
of Juno or Izumo] in non-fusing cell types, 
they found no evidence for cell fusion. They 
concluded that the Izumol-Juno interaction 
is a necessary event for adhesion between 
sperm and eggs, but that cell fusion requires 
other membrane proteins — perhaps similar 
to those required for fusion in other cell types, 
such as EFF-1 (ref. 8) and myomaker’. The 
authors go on to propose that local clustering 
of Juno on the egg’s membrane, possibly organ- 
ized by another membrane-spanning protein 
such as Cd9 (ref. 10), may occur to increase the 
strength of sperm binding. 
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Figure 1 | An adhesion complex mediates binding of sperm to eggs. a, During fertilization, a single 
sperm binds to the egg’s membrane, which lies under an extracellular coat, the zona pellucida. Bianchi et al.’ 
report that the protein Izumol, which is tethered to the membrane of sperm, forms an adhesion complex 
with its receptor protein, Juno, which spans the egg’s membrane. Fertilization does not take place in the 
absence of this complex. b, After fertilization, Juno is lost from the egg’s membrane, exiting in extracellular 
vesicles, thereby preventing the binding and fusion of additional sperm (known as the block to polyspermy). 


Bianchi et al. got another surprise when they 
looked for Juno on the fertilized egg’s mem- 
brane. Thirty to forty minutes after sperm-egg 
fusion, Juno was barely detectable, having made 
its exit in extracellular vesicles. The authors 
suggest that loss of Juno after fertilization may 
account for the fact that fusion is limited (by 
an unknown mechanism) to one sperm per 
egg, a phenomenon known as the block to 
polyspermy”' (Fig. 1b). In keeping with this 
suggestion, Juno did not exit the egg’s mem- 
brane when sperm were injected directly into 
eggs (known as intracytoplasmic sperm injec- 
tion), or when eggs were activated with ethanol. 
Neither of these treatments results in a block to 
polyspermy” and this could be due to retention 
of Juno on the fertilized egg’s membrane, per- 
mitting binding and fusion of additional sperm. 

As with other advances in biology, this 
discovery raises questions and creates possi- 
bilities. For example, whether cases of infertil- 
ity in women can be ascribed specifically to 
mutations in the gene encoding Juno remains 
to be evaluated. However, Izumol has proved 
to bea suitable candidate for the development 
of a contraceptive vaccine 3 and it is likely that 
Juno and Izumol-Juno complexes will provide 
additional targets for contraception. Detailed 
knowledge of the three-dimensional struc- 
ture of the Izumol-Juno complex could cer- 
tainly facilitate the design of small molecules 
that specifically inhibit formation of the 
adhesion complex and, consequently, prevent 
fertilization. 

Another logical next step is to identify 
proteins that interact with Izumol-Juno 
complexes and stimulate sperm-egg fusion. 
Further application of the techniques 
described by Bianchi et al. could aid the iden- 
tification of such proteins. In a similar vein, 
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the mechanism by which Juno is ejected from 
the egg membrane after sperm-egg fusion is 
not yet understood. Removal of Juno in extra- 
cellular vesicles may depend on the cortical 
reaction”, a part of the process of preventing 
polyspermy that involves fusion of vesicles 
known as cortical granules with the egg’s mem- 
brane after fertilization. Much remains to be 
done, but the identification of an egg-based 
partner for Izumol represents a significant 
step forwards for our understanding of the 
fertilization process. m 
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Guidelines for investigating causality of 
Sequence variants in human disease 
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M. B. Gerstein?°!, D. B. Goldstein’, J. N. Hirschhorn”?, S. M. Leal**, L. A. Pennacchio*»°, J. A. Stamatoyannopoulos”, 

S.R. Sunyaev”®?, D. Valle*°, B. F. Voight”, W. Winckler?+ & C. Gunter!*+ 


The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence 
variants from the many potentially functional variants present in any human genome are urgently needed. Without 
rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of 
genomic research findings into the clinical diagnostic setting and hinder biological understanding of disease. Here we 
discuss the key challenges of assessing sequence variants in human disease, integrating both gene-level and variant- 
level support for causality. We propose guidelines for summarizing confidence in variant pathogenicity and highlight 


several areas that require further resource development. 


igh-throughput sequencing approaches can generate detailed 

catalogues of genetic variation in both disease patients and the 

general population. However, for these technologies to have the 
greatest medical impact we must be able to separate genuine disease- 
causing or disease-associated genetic variants reliably from the broader 
background of variants present in all human genomes that are rare, po- 
tentially functional, but not actually pathogenic (Box 1) for the disease 
or phenotype under investigation. 

Many, but unfortunately not all, variants that have been causally 
associated with rare and common genetic disorders represent robust and 
correct conclusions. False assignments of pathogenicity can have severe 
consequences for patients, resulting in incorrect prognostic, therapeutic 
or reproductive advice, and for the research enterprise, resulting in mis- 
allocation of resources for basic and therapeutic research. Unfortunately, 
although the vast majority of genes reported as causally linked to mono- 
genic diseases are true positives, false assignments of causality at the variant 
level are a substantial issue. One recent analysis of 406 published severe 
disease mutations observed in 104 newly sequenced individuals reported 
that 122 (27%) of these were either common polymorphisms or lacked 
direct evidence for pathogenicity’. Other studies have identified numer- 
ous alleged severe-disease-causing variants in the genomes of population 
controls**. In other cases, well-powered follow-up studies of high-profile 
reported mutations have cast serious doubts on initial reports assigning 


disease causality to sequence variants**, but the vast majority of false- 
positive findings probably remain undetected. As the volume of patient 
sequencing data increases it is critical that candidate variants are sub- 
jected to rigorous evaluation to prevent further misannotation of the path- 
ogenicity of variants in public databases. 

This paper describes the challenges in reliably investigating the role 
of sequence variants in human disease, and approaches to evaluate the 
evidence supporting variant causality. It represents the conclusions of a 
working group of experts in genomic research, analysis and clinical diag- 
nostic sequencing convened by the US National Human Genome Research 
Institute. 

We focus on the application of genome-scale approaches to investi- 
gating rare germline variants, defined here as variants with a minor allele 
frequency of <1%. Our recommendations are most relevant for variants 
with relatively large effects on disease risk. Our intended scope encom- 
passes the vast majority of variants implicated in severe monogenic dis- 
eases as well as rare, large-effect risk variants in complex disease®, but 
excludes the common, small-effect variants typically identified by gen- 
ome-wide association studies of complex traits’. 

Unambiguous assignment of disease causality for sequence variants is 
often impossible, particularly for the very low-frequency variants under- 
lying many cases of rare, severe diseases. Consequently, we refer in this 
manuscript to the concept of implicating a gene or sequence variant: that 
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BOX | 
Terms used to describe sequence 
variants 


Lack of clarity in the terms used to describe sequence variants is a 
major source of confusion in human genetics. We have adopted the 
following definitions for terms used throughout this manuscript. 

Pathogenic: contributes mechanistically to disease, but is not 
necessarily fully penetrant (i.e., may not be sufficient in isolation to 
cause disease). 

Implicated: possesses evidence consistent with a pathogenic role, 
with a defined level of confidence. 

Associated: significantly enriched in disease cases compared to 
matched controls. 

Damaging: alters the normal levels or biochemical function of a 
gene or gene product. 

Deleterious: reduces the reproductive fitness of carriers, and would 
thus be targeted by purifying natural selection. 


is, the process of integrating and assessing the evidence supporting a role 
for that gene or variant in pathogenesis. We emphasize the primacy of 
strong genetic support for causation for any new gene, which may then 
be supplemented and extended with ancillary support from functional 
and informatic studies. 

Our recommendations centre on five key areas: study design; gene- 
level implication; variant-level implication; publication and databases; 
and implications for clinical diagnosis. Core guidelines for researchers 
are summarized in Box 2. We also provide a list of factors to consider in 
the analyses of candidate variants in presumed monogenic diseases (Sup- 
plementary Information) and a list of resources for assessing pathogeni- 
city (Supplementary Table 1). 


Study design 

Investigators seeking to identify pathogenic variants should select tech- 
nological and analytical approaches based on the most likely genetic archi- 
tecture of the disease of interest. Rare, high-penetrance protein-coding 
variants can be cost-effectively captured by exome sequencing, which is 
rapidly becoming the first-line approach for presumed monogenic disorders’. 
Cytogenomic arrays and genotyping of linkage panels remain useful ap- 
proaches for the identification of copy number variation and for identify- 
ing co-segregating haplotypes within large Mendelian (especially dominant) 
disease families, respectively. Optimal approaches to discovering rare path- 
ogenic variants in complex diseases remain unclear: exome sequencing’, 
deep and low-coverage whole-genome sequencing” and/or next-generation 
genotyping arrays with enhanced coverage of protein-coding variants are 
all being applied in research settings. As the cost of sequencing declines, 
we expect that deep whole-genome sequencing will soon become the 
technology of choice for investigating all genetic architectures. 

In selecting technological and analytical approaches for a new study, 
investigators should consider formal power calculations" incorporating 
predicted distributions of allele frequencies and effect sizes for patho- 
genic variants, genetic and phenotypic heterogeneity of available cohorts, 
population frequency of the disease, and available sample sizes. Although 
parameter values may be uncertain, current knowledge of the genetics of 
the disease and similar traits can be used to constrain likely ranges. In 
particular, for many diseases there is overwhelming evidence that both 
locus and allelic heterogeneity is high, such as in autism, epilepsy and schizo- 
phrenia. A study design that assumes low locus and allelic heterogeneity 
would be certain to fail for these conditions, and this fact would be re- 
vealed by even casual evaluations of power for reasonable genetic models. 
Gene discovery for conditions with low locus heterogeneity and suffi- 
ciently high-penetrance mutations is occasionally possible by sequencing 
a single family’*; however, most gene-discovery applications will require sub- 
stantially larger sample sizes: multiple unrelated families for rare monogenic 
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conditions, and thousands to tens of thousands of patients and controls 
for complex disorders’. 

To assemble large sample sizes will typically require pooling of pa- 
tient cohorts by multiple investigators. Although such consortium ap- 
proaches are desirable, investigators should be mindful of systematic 
differences among cohorts stemming from technical biases, population 
stratification, and genetic and phenotype heterogeneity. For studies of 
complex traits, many quality-control methods developed for genome- 
wide association studies of common variants will also apply to rare variant 
studies'*, but DNA sequencing data face a different and typically more 
challenging set of quality considerations, particularly when data sets are 
combined for meta-analysis. In addition, new methods may need to be 
developed to address population stratification of rare variants’’, which 
show stronger geographic clustering than common variants’*®; to min- 
imize the impact of stratification, controls should be matched closely to 
the ancestry of patient samples. 

For presumed monogenic diseases, the availability of multiple fam- 
ilies with very similar clinical phenotypes substantially increases power 
for gene discovery. For cases in which there is a single affected proband 
and no family history, investigators should consider sequencing the unaf- 
fected parents of the probands, permitting efficient discovery of de novo 
mutations and compound heterozygous genotypes. Investigators should 
begin by examining sequence variation in genes known to be associated 
with that phenotype, and assessing sequence coverage of the coding se- 
quences and splice junctions for these genes before exploring the pos- 
sibility of new candidate genes in the affected individuals. 


Gene-level implication 


To implicate a variant as pathogenic requires that the DNA sequence 
affected by that variant has a role in the disease process. For genes not 
previously reported as causal, investigators must simultaneously dem- 
onstrate evidence for a role of a candidate gene and one or more variants 
disrupting it. Even if the candidate gene has been previously implicated 
in the same or a similar disease phenotype, the overall support from 
published sources should be carefully assessed and reported. Multiple 
classes of evidence may potentially contribute to pathogenic inferences 
at the level of both gene and variant, and include genetic, informatic and 
experimental data (Table 1 and Supplementary Information). However, 
in keeping with the history of the field of human genetics, we emphasize 
the critical primacy of robust statistical genetic support for the implica- 
tion of new genes, which may then be supplemented with ancillary 
experimental or informatic evidence supporting a mechanistic role for 
the gene in the disease in question. 

Historically, gene-level implication in monogenic diseases has relied 
first on identifying a narrow set of candidate genes through genetic data 
such as linkage analyses or experimental data on biochemical function, 
and then identifying rare, probably damaging variants (altering the nor- 
mal levels or biochemical function of a gene or gene product) in one of the 
candidate genes in multiple affected patients. The increasing availability 
of large-scale sequencing data now allows genome-scale approaches to 
gene discovery, in which the distribution of rare, predicted gene-disrupting 
variants in patients is systematically compared to population controls or 
well-validated null models to identify genes with an excess of potentially 
pathogenic variants for clinical and functional follow-up. 

It is worth emphasizing that the whole-genome sequence data sets 
are in some ways more prone to misinterpretation than earlier analyses 
because of the sheer wealth of candidate causal mutations in any human 
genome, many of which may provide a compelling story about how the 
variant may influence the trait; a problem that has been referred to as the 
‘narrative potential’ of human genomes”. To avoid such biases the evi- 
dence supporting any candidate gene should be contrasted wherever pos- 
sible with the evidence observed at other presumably non-disease-related 
genes (for example, by ranking the gene among all others and reporting 
the probability ofa similar or greater contrast being observed by chance). 
Formal genome-wide statistical approaches to monogenic-disease gene 
discovery will require considerable methods development, but general 


©2014 Macmillan Publishers Limited. All rights reserved 


PERSPECTIVE | RESEARCH | 


BOX 2 
Guidelines for implicating sequence variants in human disease 


General guidelines 

e@ Provide complete positive and negative evidence associated with the gene or variant implication, not just the results that are consistent with 
pathogenicity. 

e |Inall cases in which itis possible, place genetic, informatic and experimental results within a quantitative framework: determine the probability 
of observing this result by chance with a randomly selected variant or gene. 


rigorously as your own data. 


Assessment of evidence for candidate disease genes 


implicated in multiple unrelated individuals. 


or well-calibrated null models. 
Assessment of evidence for candidate pathogenic variants 


matched as closely as possible to patients in terms of ancestry. 
implicate a variant as playing a causal role in disease. 


methods as independent lines of evidence for implication. 


animal models of gene function. 


Publications and reporting 


databases. 


conveyed as such, along with appropriate supporting evidence. 


is high. 


e@ Take advantage of public data sets of genomic variation, functional genomic data and model-organism phenotypes. 
e@ Donot regard prior reports of gene or variant implication as definitive: to the degree that supporting data are available, reassess them as 


e@ Describe and assess clearly the available evidence supporting prior reports of gene or variant implication. 


e@ In presumed monogenic-disease cases, evaluate genes previously implicated in similar phenotypes before exploring potential new genes. 
e@ Reporta new gene as confidently implicated only when variants in the same gene and similar clinical presentations have been confidently 


e Inallcasesin which itis possible, apply statistical methods to compare the distribution of variants in patients with large matched control cohorts 


e@ Determine and reportthe formal statistical evidence for segregation or association of each variant, and its frequency in large control populations 
e@ Recognize that strong evidence that a variant is deleterious (in an evolutionary sense) and/or damaging (to gene function) is not sufficient to 
e@ Predict variant deleteriousness with comparative genomic approaches, but avoid considering any single method as definitive or multiple 

e@ Validate experimentally the predicted damaging impact of candidate variants using assays of patient-derived tissue or well-established cell or 


e@ Avoid assuming that implicated variants are fully penetrant, or completely explanatory in any specific disease case. 


e Assess and report objectively the overall strength and cohesiveness of the evidence supporting pathogenicity for all variants listed in a publication. 
e@ |nall cases in which it is possible, ensure that the level of confidence of pathogenicity and supporting evidence are propagated in variant 


e@ Deposit genotype and phenotype data for both controls and disease patients, and for resultant analyses demonstrating associations, in publicly 
accessible databases, to the maximum degree permissible under study-specific participant consent and ethical approval. 


e lf returning results for clinical use, highlight strong, actionable findings but also ensure that uncertain or ambiguous findings are clearly 


e Provide clear cautions regarding decision-making based on variants with limited evidence when the potential for use in medical interventions 


guidelines for establishing the significance of variation can be considered 
here. As we discuss below, these considerations apply equally to assessing 
the significance of rare variation in common disease studies. 

Our paramount recommendation is that for genome-wide analyses of 
rare variants for both Mendelian and complex disorders, formal calcula- 
tion of statistical significance should be used to evaluate the strength of 
evidence of a set of findings, following the well-established standard of 
maintaining overall type I (false discovery) error rates below 5%. For 
example, investigators should not simply assume that the presence of two 
or more independently occurring de novo mutations in the same gene 
within a sequenced cohort is definitive evidence of a causal role for that 
gene’*; such a threshold results in ever increasing numbers of false 
positives as the number of sequenced cases increases. To illustrate this, 
consider the recent situation of four exome sequencing studies, involving 
a total of 945 families with a child affected by autism”, which together 
observed four independent de novo missense mutations in TTN. Never- 
theless, the investigators did not consider TTN to have a causal role in 
autism, and appropriately so: using a statistical model similar to prev- 
iously published approaches®**** that accounts for gene size (ITN has 
the largest coding sequence of any gene in the genome), mutation rate, 
number of trios and distribution of exome coverage, 1.96 de novo TIN 
missense or loss-of-function mutations are predicted by chance, which is 
not significantly different (P = 0.14) from the four observed. 

We consider a single gene as the fundamental unit for monogenic 
disease gene testing, for all disease models; a disease caused by de novo 


mutations or a disease caused by inherited dominant or recessive var- 
iants. An appropriate framework for detecting pathogenic variants will 
evaluate all of the variation in a gene compared to a well-calibrated null 
model specific for the hypothesis being considered (for example, de novo, 
dominant, recessive). 

Although the field has well-established guidelines for declaring signi- 
ficance using linkage data”, it is now important to consider a conserva- 
tive baseline threshold for declaring significance purely from sequencing 
data of cases, in the absence of other genealogical information. In this 
scenario, as the gene is the fundamental unit of analysis, and there is no 
additional data to constrain the search space for genes, a typical study 
might perform tests on 21,000 protein-coding genes and 9,000 long non- 
coding RNA genes”*”’. A conservative genome-wide significance thresh- 
old corresponding to this testing strategy is a Bonferroni-corrected P value 
of 1.7 X 10° © (that is, 0.05 out of 30,000). Importantly, if several different 
schemes are used to define ‘qualifying mutations’ in such analyses, it is 
necessary to make further statistical adjustments for each of the different 
sets of rules that are used. 

Formal null models can be specified based on the disease model of 
interest. As mentioned above, the null model for the case of the de novo 
mutation analysis should consider confounding variables such as sam- 
ple size, gene size and mutation rate (which may vary by orders of mag- 
nitude among genes). We note that such null models have power even 
for extremely rare conditions and small sample sizes: the first exome 
sequencing study of Kabuki syndrome”* initially identified 7 de novo 
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Table 1 | Classes of evidence relevant to the implication of sequence variants in disease 


Evidence level Evidence class 


Examples 


Gene level Genetic 


Experimental 


Variant level Genetic 


Gene burden: the affected gene shows statistical excess of rare (or de novo) probably damaging variants segregating 
in cases compared to control cohorts or null models. 

Protein interactions: the gene product interacts with proteins previously implicated (genetically or biochemically) 
in the disease of interest. 

Biochemical function: the gene product performs a biochemical function shared with other known genes in the 
disease of interest, or consistent with the phenotype. 

Expression: the gene is expressed in tissues relevant to the disease of interest and/or is altered in expression in 
patients who have the disease. 

Gene disruption: the gene and/or gene product function is demonstrably altered in patients carrying candidate 
mutations. 

Model systems: non-human animal or cell-culture models with a similarly disrupted copy of the affected gene show 
a phenotype consistent with human disease state. 

Rescue: the cellular phenotype in patient-derived cells or engineered equivalents can be rescued by addition of the 
wild-type gene product. 

Association: the variant is significantly enriched in cases compared to controls. 

Segregation: the variant is co-inherited with disease status within affected families and additional co-segregating 
pathogenic variants are unlikely or have been excluded. 

Population frequency: the variant is found at a low frequency, consistent with the proposed inheritance model and 
disease prevalence, in large population cohorts with similar ancestry to patients. 


Informatic 


Conservation: the site of the variant displays evolutionary conservation consistent with deleterious effects of 


sequence changes at that location. 
Predicted effect on function: variant is found at the location within the protein predicted to cause functional 
disruption (for example, enzyme active site, protein-binding region). 


Experimental 


Gene disruption: the variant significantly alters levels, splicing or normal biochemical function of the product of the 


affected gene. This is shown either in patient cells or a well-validated in vitro model system. 

Phenotype recapitulation: introduction of the variant, or an engineered gene product carrying the variant, into a cell 
line or animal model results in a phenotype that is consistent with the disease and that is unlikely to arise from 
disruption of genes selected at random. 

Rescue: the cellular phenotype in patient-derived cells, model organisms, or engineered equivalents can be rescued 
by addition of wild-type gene product or specific knockdown of the variant allele. 


loss-of-function variants in the MLL2 gene in just 10 sequenced patients, 
a finding that is extremely unlikely by chance under the background 
mutation model described above (P = 1.9 X 10 78) and that provided 
compelling evidence implicating this gene as causal. 

Formal methods for assessing the significance of observations in rare 
disease cohorts can also be used to assess, for example, the aggregate evi- 
dence for segregation of rare variants in a particular gene when consider- 
ing inherited variation, building on previously published examples”. In 
this case, the null model should be a population genetic model, for in- 
stance, the site frequency spectrum (SFS) of variation constructed from a 
well-matched control cohort. The null model of the SFS for a given gene 
should consider both the mutation rate and selective constraint acting on 
that gene. When evaluating data from a single case, the probability that 
the variation in a gene is from the null model can by estimated by first 
identifying the most pathogenic class of variant present in that gene in 
that case, and then by calculating the probability of sampling a variant of 
the same class of pathogenicity from the null SFS. Similarly, when the 
recessive disease model applies, the most pathogenic class of variant on 
the paternal and maternal haplotypes is identified, and then the prob- 
ability of sampling both variants from the null SFS is calculated. This 
testing framework for inherited variants is easily scaled to include mul- 
tiple disease cases. Ideally, to avoid false positives, the control cohort upon 
which the SFS is based would be sequenced and analysed in a manner 
identical to the disease cases. 

Such methods may not yet be applicable to every rare disease scenario, 
and will require work to extend to more exotic inheritance modes such 
as parental imprinting or obligate compound heterozygosity”. Although 
formal methods are established to perform these tests rigorously, research- 
ers should at the very least evaluate and report the level of background 
variation in an implicated gene in population cohorts, taking advantage of 
public resources such as the Exome Variant Server (http://evs.gs.washington. 
edu/EVS/) when implicating a new gene in pathogenesis. Furthermore, 
the analysis of at least some number of controls, sequenced and analysed 
in a manner identical to cases, can be critical for avoiding the systematic 
false positives that remain commonplace in exome and genome sequencing. 

Just as for genome-wide association studies of common variants", 
replication of newly implicated disease genes in independent families 
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or population cohorts is critical supporting evidence, and in most cases 
essential for a novel gene to be regarded as convincingly implicated in 
disease. For the rarest disorders additional cases for independent rep- 
lication may be unavailable and it may be impossible to make a com- 
pelling statistical case for implication from human genetic data alone. In 
these cases, gene implication must be based on an integrated analysis of 
genetic, informatic and experimental evidence. 

Provided that it is carried out in a statistically rigorous fashion, ancil- 
lary information can be used to boost power for gene discovery. For example, 
many genome-wide sequencing-based studies treat all protein-altering 
variants equally while ignoring all other classes of variants. More elegant 
schemes aimed at prioritizing based on predicted pathogenicity may boost 
power for such studies. Another approach is to stratify gene candidates by 
their expression in a tissue appropriate to the disease under analysis. For 
example, a recent study combined variant- and gene-level stratification 
to show that the de novo mutation rate in congenital heart disease was 
similar in cases versus controls, but the odds ratio rose to 7.5 when focus- 
ing on de novo mutations predicted to be damaging and to occur in genes 
expressed in the developing heart”’. 

Experimental evidence that can contribute to support for gene impli- 
cation falls into three broad categories, listed here in order of increasing 
strength. First, experimental data can be used to demonstrate that the 
normal function of the gene is consistent with the known biology of the 
disease process, for example by showing that the gene is expressed in 
tissues relevant to the disease*, or that its protein product co-localizes 
with, or physically interacts with, the products of other genes previously 
implicated in the disease*’. Second, investigators can demonstrate that a 
gene product is functionally disrupted by mutations in patients with the 
disease of interest, as discussed in the variant-level evidence section below. 
Lastly, disruption of the candidate gene in a model organism can be shown 
to result in a phenotype that recapitulates the relevant pathology in humans 
and is unlikely to occur with disruption of genes selected at random****. 

A complete description of the experimental methods relevant to gene 
implication falls outside the scope of this manuscript. However, we note 
that the value of experimental approaches depends critically on the appro- 
priateness of the model system to the human disorder that is being inves- 
tigated. Whether cell line or animal models will be most appropriate will 
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depend on context: simple cultured cell models may be inappropriate for 
developmental disorders affecting complex organ systems. For similar 
reasons, animal models are not well suited for analysis of human-specific 
aspects of biology. 

As noted above, it is also important to consider the specificity of gene- 
level support; that is, the probability of observing a similar result if the 
experiment or analysis was performed with a randomly selected gene. 
For example, if a new candidate gene is implicated in non-syndromic 
short stature in humans, observing that its orthologue is associated with 
small body size in knockout mice is relatively uninformative given that a 
similar phenotype occurs in over 30% of all knockout mouse strains*®. 
Similarly, reports that the product of a gene potentially implicated in a 
metabolic disorder is localized to mitochondria should also consider that 
these are complex organelles with many highly expressed genes. Wherever 
possible, investigators should use informatics approaches to assess such 
metrics in publicly available high-throughput data sets of functional geno- 
mic and model organism phenotype data*’. Although it remains challen- 
ging to quantify the statistical confidence of functional observations, those 
that can be convincingly demonstrated to represent very low-probability 
events under an appropriate null hypothesis provide more compelling sup- 
port for implicating a given variant. Even in situations in which a formal 
statistical framework is not possible we emphasize that researchers must 
assess functional data rigorously and clearly report their limitations. 


Variant-level implication 


Genetic evidence implicating a variant must be assessed within the con- 
text of the considerable background of rare genetic variants in humans. 
Even healthy individuals carry many rare protein-disrupting variants”, 
and about half carry at least one de novo protein-altering mutation”. 
Such variants are therefore not typically sufficient proof of causality when 
observed in a disease case, even if present in well-established disease genes: 
genes differ markedly in their tolerance to variation” and rare variants 
predicted to be damaging in disease-associated genes are often observed 
even in population controls*’. 

In both established and newly implicated disease genes, investigators 
should formally assess and report the statistical support for association. 
Family-based studies should also assess co-segregation of candidate var- 
iants with disease status. Given that a separate, unobserved pathogenic 
mutation may lie on the same haplotype as the candidate variant, segrega- 
tion analysis alone cannot definitively implicate a specific variant as path- 
ogenic, but (at least under an assumption of complete penetrance) lack of 
segregation can exclude non-pathogenic variants from consideration. 

Informatic and/or experimental evidence for variant implication can 
be used to assess whether a variant is likely to be deleterious in an evo- 
lutionary sense (Box 1), which primarily comes from in silico annotation 
and comparative genomics”, and predict that a variant is damaging in 
terms of biological function, arising both from computational predic- 
tions and experimental assays. Both categories of evidence can support 
implication, but they do not necessarily demonstrate a causal role for the 
variant with respect to the trait under study. Again, we stress that hun- 
dreds to thousands of coding variants in an individual will typically be 
labelled as potentially deleterious or damaging, or both; the strength of 
the resulting evidence for pathogenicity must be considered in the con- 
text of this background level of variation. 

Measures of evolutionary sequence conservation are widely used indica- 
tors of deleteriousness for both protein-coding and non-coding variation”. 
Such approaches have demonstrated value in prioritizing candidate 
variants*’**; however, their predictive power is limited by both statistical 
and biological factors. Many deleterious variants do not show a strong 
conservation signature, particularly if the gene has been subject to rapid 
evolution in the human or primate lineage, or if there have been compen- 
satory substitutions in other regions of the protein in ancestral species*’. 
Conversely, strong conservation can be maintained at sites subject to 
even relatively weak selective pressure, at which variants may have only 
small effects on disease risk. The power of these methods also depends on 
the accuracy and phylogenetic scope of the underlying sequence alignments. 
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These limitations should be taken into account when using predictions of 
deleteriousness as evidence for implication. Even though it is worthwhile 
to use multiple prediction algorithms, investigators should avoid treat- 
ing these as though they represent strong or independent lines of evidence 
for pathogenicity. 

Although some classes of variation, such as truncating or splice-site- 
disrupting variants in the middle of a protein-coding gene, are more likely 
to be damaging than others, such variants are also enriched for sequen- 
cing and annotation errors and may be rescued by alternative RNA splic- 
ing, other variants, or local sequence context*'. These possibilities should 
be assessed, and if possible the predicted damaging effect should be con- 
firmed experimentally. 

Experimental approaches to investigating the impact of a sequence 
variant on gene function, or cell or organism phenotype, can also have a 
role in demonstrating that a variant is damaging to gene function and in 
identifying the molecular mechanisms underlying a variant’s effect on 
disease risk. However, great care must be taken to select appropriate ex- 
perimental methods, which will depend on the class of variant, biolog- 
ical context (for example, tissue type), access to samples and reagents, 
desired throughput, time and cost. When a gene has already been con- 
fidently implicated in disease, and it is known what class of variant is 
causal (for instance, loss or gain of function as represented by a specific 
assay), then an experiment that places a variant of unknown significance 
into such a functional class can be particularly informative. 

Evidence derived directly from patient tissue or cells can often be 
stronger than that from model systems, particularly (for loss-of-function 
variants) if the molecular defect can be rescued by complementation in a 
cellular assay. Replicating disease-relevant phenotypes in a heterologous 
cell line engineered to carry the proposed causal variant can help to rule 
out effects of a patient’s genetic background on disease outcome. Weaker 
but still valuable support can be provided by assays performed in model 
organisms, more artificial cell culture systems, and non-cellular models 
such as construct-based assays of altered protein-protein interactions or 
transcript splicing. Models are most valuable if they directly mimic the 
predicted functional impact of the candidate variants: for example, knock- 
out mice are better models of recessive loss of function than of dominant 
missense mutations in a candidate gene. In the case of compound het- 
erozygous recessive inheritance—particularly if the proposed mode of 
action depends on an interaction between allelic variants, such as in TAR 
(thrombocytopenia with absent radius) syndrome**—it will be necessary 
to develop cellular assays that incorporate and assess multiple variants 
simultaneously. 

The impact of variation in non-protein-coding regions of the gnome— 
such as splicing and transcriptional enhancers—remains particularly chal- 
lenging to interpret, but we note that systematic experimental approaches 
have begun to both highlight the regions of the human genome most likely 
to have a role in gene regulation”, and to dissect the potential impact 
of variation within them’’. However, given the challenges of predicting 
impact for non-coding variants, it remains critical to determine whether 
the purported pathogenic variant does in fact produce the expected effect 
on expression or splicing of the affected gene, either by demonstrating an 
unusual expression level in the patient or by in vitro experimentation (such 
as minigene constructs). 

We caution against the assumption that convincingly implicated 
variants, even in presumed monogenic disorders, are necessarily fully 
penetrant (that is, sufficient in isolation to cause disease). In fact the 
penetrance of most reported disease-associated mutations has not been 
accurately assessed with current data owing to the biases associated with 
sample ascertainment. Indeed, the prevalence of reported severe-disease- 
causing mutations in population controls”* suggests that incomplete pene- 
trance, false assignment of pathogenicity, or wider-than-appreciated ranges 
of expressivity area substantially more common feature of reported Men- 
delian disease mutations than generally appreciated. Accurate estimates 
of penetrance require characterization of reported mutations in large, 
well-phenotyped population cohorts**°*. Further large-scale studies of 
this kind should be a priority for the field. 
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Wealso note the underappreciated importance of calibrating the accu- 
racy of functional assays by large-scale testing of variants confidently estab- 
lished to be non-pathogenic (for example, common missense polymorphisms 
in the gene of interest). Such experiments establish a baseline estimate for 
the impact of well-tolerated variants on the assay in question. 


Publication and data sharing 


As noted above, there are many false positives in disease-mutation data- 
bases, stemming largely from erroneous assignment of pathogenicity 
both in clinical diagnostic laboratories and in the primary literature’**’. 
To reduce this burden will require robust, centralized repositories of 
mutation data, incorporating explicit, structured evidence for variant 
pathogenicity and systems for rapid correction of entries. To incentivize 
both research and clinical laboratories to deposit variation data into open 
repositories, and to update evidence for or against implication, is a key 
challenge to be addressed by funding bodies, journals, research consortia, 
clinical organizations and others”’. We are hopeful that such activities can 
be coordinated around the US National Center for Biotechnology Infor- 
mation (NCBI)’s newly launched ClinVar database (https://www.ncbi. 
nlm.nih.gov/clinvar/), which will also interface with existing efforts in 
this space including the LOVD (Leiden Open (source) Variation Database)? 
and other locus-specific databases, OMIM (Online Mendelian Inheritance 
in Man; http://omim.org/) and DECIPHER (Database of Chromosomal 
Imbalance and Phenotype in Humans Using Ensembl Resources)”. 

In some cases—such as diseases that are extremely rare or have high 
degrees of locus heterogeneity—it may be impossible to obtain definitive 
evidence implicating a specific gene or variant with available sample sizes. 
In such cases we acknowledge that the suggestive evidence pointing to a 
gene’s potential implication can nevertheless be valuable in future clinical 
and research investigations, and should not be excluded from publica- 
tions or the public domain. However, it is incumbent on investigators, 
reviewers and journals to be explicit in describing the supporting evid- 
ence and the degree of confidence in causality for each proposed gene 
association and reported variant. 

Finally, we emphasize the value of sharing sequence and phenotype 
data from clinical and research samples to the fullest possible extent. Many 
investigators and research funders consider responsible data sharing to 
be a moral and professional imperative*’. In many cases, particularly for 
extremely rare phenotypes, individual laboratories that are not actively 
recruiting subjects will evaluate only a handful of samples. Sharing of se- 
quence data among testing laboratories has often been restricted, so that 
many potentially pathogenic mutations and associated phenotypes are 
known only to individual laboratories. The availability of genome-wide 
variant calls and detailed clinical phenotype descriptions from such patients 
in centralized repositories—which will require substantial investment both 
in informatic infrastructure and new ethical frameworks—would permit 
more rapid accumulation of evidence for novel genes, and continuous 
reanalysis to refine the classification of potentially implicated variants 
and the genotype-phenotype map of human disease. Models for success- 
ful data sharing efforts in rare disease already exist in the field of copy 
number variation with the DECIPHER database” and the International 
Standards for Cytogenomic Arrays Consortium (https://www.iscacon 
sortium.org/), aided by an increasing number of rare-disease resource 
consortia, and several ambitious efforts to establish clear global standards 
for genomic data sharing are now underway”. 


Added challenges in clinical settings 
Although this summary is focused on research, research findings pro- 
vide the foundation for clinical interpretation. Questionable attributions 
of causality based on weak research evidence can be readily propagated 
through research databases and can be misinterpreted clinically as stron- 
ger than they truly are. Thus, even researchers who do not explicitly pro- 
vide diagnosis to patients should be aware that their published findings 
may be used as support for decisions made in clinical settings. 

Clinical laboratories face similar challenges in assessing variant path- 
ogenicity as do researchers, but with the added pressures of diagnostic 
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urgency and the potentially severe consequences of misdiagnosis. Al- 
though guidelines are available for variant interpretation in a diagnostic 
setting”, analytical frameworks for next-generation sequencing data are 
only beginning to emerge**”’. Responsible application of these technolo- 
gies will require standards for test validation, variant interpretation and 
return of results. 

The results of genetic and genomic testing are increasingly being used 
in medical decision-making, including recommendations for prophy- 
lactic mastectomy, cardiac defibrillator implantation, tumour therapy 
and prenatal diagnosis. These actions are neither generally inappropri- 
ate nor uniformly incorrect; however, the potential for harm due to mis- 
interpretation of variants is substantial. Although physicians must often 
make medical decisions using imperfect or ambiguous data, it is critical 
that healthcare providers be made aware of the varying levels of certainty 
in the evidence for implicating a variant in disease, both through the 
consistent use of variant classification terminologies and descriptions of 
the supporting evidence or lack thereof. 


Conclusions 


High-throughput DNA sequencing technologies provide unprecedent- 
ed opportunities to discover new genes and variants underlying human 
disease, but these discoveries must be rigorously performed and repli- 
cated to prevent the proliferation of false-positive findings. 

Assessment of evidence for variant implication is a two-step process. 
First, the overall evidence for implication of a gene should be considered, 
focusing primarily on the statistical support for implication from genetic 
analyses, potentially supplemented by ancillary data from informatic 
sources and functional studies. Second, a combined assessment of the 
genetic, experimental and informatic support for individual candidate 
variants should be performed. Such assessments should be performed 
even if the genes or variants have been previously reported as confi- 
dently implicated; prior evidence should be continuously re-evaluated 
with newly available information. 

We urge that, whenever possible, investigators assess the results of ge- 
netic, informatic and functional analyses within a quantitative statistical 
framework, such as determining the probability of the observed distri- 
bution of genetic variants in cases and controls under the null hypoth- 
esis, and the a priori power to detect variants of a specified frequency and 
effect size. The specificity of experimental or informatic results provided 
in support of implication should also be assessed whenever possible by 
asking how often a similar result would be obtained by chance among a 
set of random variants or genes. In such analyses investigators should 


BOX 3 
Priorities for research and 
infrastructure development 


e@ Improved public databases of human genetic variants 
incorporating explicit, up-to-date supporting evidence for variant 
implication in disease and audit trails recording changes in 
interpretation. 

e@ Improved incentives, and ethical and logistical solutions, for 
sharing of genetic and phenotypic data from both research and 
clinical diagnostic laboratories. 

e@ Public databases of variant and allele frequency data from large 
sets of population reference samples from a wide range of ancestries. 
e Large-scale genotyping of reported human disease-causing 
variants in large, well-phenotyped population cohorts, reducing biases 
in the assessment of the associated penetrance and phenotypic 

heterogeneity. 

@ Development and benchmarking of standardized, quantitative 
statistical approaches for objectively assigning probability of 
causation to new candidate disease genes and variants. 
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take advantage of the increasing availability of genome-scale sequencing 
and functional data, and help to build these resources by contributing 
their findings to public databases. 

The community should also focus on the ongoing development of re- 
sources in several key areas (Box 3). In particular, major improvements 
in databases of reported pathogenic mutations, including details of the 
evidence supporting pathogenicity, are urgently needed. Large-scale 
experiments to assay previously reported disease-associated mutations 
in additional large, well-phenotyped populations will also be required to 
confirm pathogenicity and provide robust evidence of penetrance and 
expressivity. Finally, extensive work is needed to develop formal statistical 
frameworks for quantifying the strength of the evidence for implication. 

Objective, systematic and quantitative evaluation of the evidence for 
pathogenicity and sharing of these evaluations and data amongst re- 
search and clinical laboratories will maximize the chances that disease- 
causing genetic variants are correctly differentiated from the many rare 
non-pathogenic variants seen in all human genomes. 
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Sea-level and deep-sea-temperature 
variability over the past 5.3 million years 


E. J. Rohling’”, G. L. Foster”, K. M. Grant, G. Marino!, A. P. Roberts!, M. E. Tamisiea* & F. Williams? 


Ice volume (and hence sea level) and deep-sea temperature are key measures of global climate change. Sea level has been 
documented using several independent methods over the past 0.5 million years (Myr). Older periods, however, lack 
such independent validation; all existing records are related to deep-sea oxygen isotope (5'8O) data that are influenced 
by processes unrelated to sea level. For deep-sea temperature, only one continuous high-resolution (Mg/Ca-based) 
record exists, with related sea-level estimates, spanning the past 1.5 Myr. Here we present a novel sea-level recon- 
struction, with associated estimates of deep-sea temperature, which independently validates the previous 0-1.5 Myr 
reconstruction and extends it back to 5.3 Myr ago. We find that deep-sea temperature and sea level generally decreased 
through time, but distinctly out of synchrony, which is remarkable given the importance of ice-albedo feedbacks on the 
radiative forcing of climate. In particular, we observe a large temporal offset during the onset of Plio- Pleistocene ice ages, 
between a marked cooling step at 2.73 Myr ago and the first major glaciation at 2.15 Myr ago. Last, we tentatively infer 
that ice sheets may have grown largest during glacials with more modest reductions in deep-sea temperature. 


To understand better the potential response of ice volume (sea level) 
to global warming, there is a need for continuous, highly resolved and 
well-quantified records of sea-level variations associated with past 
climate fluctuations!“. In addition, such records are critical for under- 
standing the development of major ice-age cycles over the past ~3 Myr 
and of the attendant reorganizations in the coupled climate—ocean sys- 
tem, including extensive biological and biogeochemical perturbations” *. 

Continuous sea-level records with centennial resolution, suitable for 
investigating magnitudes and rates of sea-level change, exist for the past 
0.5 Myr (refs 1, 9-11). For older periods, existing millennially resolved 
sea-level reconstructions include (1) a continuous, model-based decon- 
volution of global deep-sea benthic foraminiferal 3180 data (5'%0,) 
into temperature and ice-volume changes back to 35 Myr ago”; (2) a 
scaling of global deep-sea 5'*O, using New Zealand sequence strati- 
graphic data for 3.4-2.3 Myr ago””; (3) a direct scaling of another deep- 
sea 8'°O,, compilation for the past 7 Myr (ref. 13); and (4) a deep-sea 
5/80, record (corrected for deep-sea temperature, Ty,) over the past 
1.5 Myr from Chatham rise, in the southwest Pacific Ocean, which is argued 
to be representative of global deep water™. A further, lower-resolution, 
Tas-corrected deep-sea 5/80, record exists for the North Atlantic'’, al- 
though questions exist concerning carbonate chemistry influences on 
the epibenthic species analysed in that record’’, and about site-specific 
issues regarding North Atlantic Deep Water property variations versus 
potential water-mass changes due to Antarctic Bottom Water penetration. 

All existing methods rely on deep-sea 80, and are, therefore, not 
independent of each other. In addition, there is limited temporal over- 
lap between these reconstructions and, importantly, methodological 
uncertainties are typically much larger in sea-level reconstructions for 
periods before 0.5 Myr ago. Hence, it is necessary to develop indepen- 
dent sea-level reconstructions to identify mutually consistent patterns. 
This in turn will enable fundamental questions to be addressed con- 
cerning the timing and development of Northern Hemisphere glaci- 
ation, sea-level variability during past warm periods with greenhouse gas 
concentrations similar to those of today, and the long-term relationship 
between ice volume, temperatures and greenhouse gas concentrations. 


To advance the debate, we here present a new and independent sea- 
level reconstruction that spans the past 5.3 Myr. 


Location for new sea-level reconstruction 


The Red Sea would be a promising location for developing an extended 
sea-level reconstruction beyond 0.5 Myr ago’"'*'"'”. Red Sea sea-level 
reconstructions for the past 0.5 Myr rely on hydraulic control of water 
exchange through a shallow and narrow connection with the open ocean 
(the Bab-el-Mandab Strait)'°’”. The method is independent of deep- 
sea 8'8O,, and yields ‘Relative sea level at Bab-el-Mandab’ (RSLg.\) recon- 
structions with a lo uncertainty of ~6 m (refs 10, 17). Unfortunately, 
no high-quality Red Sea sediment cores exist that allow extension of 
RSLgem beyond 0.55 Myr ago. We therefore shift focus to the Mediter- 
ranean Sea, which is another evaporative marginal sea with limited con- 
nection to the open ocean. 

Discerning a sea-level signal in Mediterranean records of carbonate 
microfossil 5'°O is more complex than in the Red Sea because (1) the 
larger strait profile at Gibraltar, relative to Bab-el-Mandab, causes a 
lower signal-to-noise ratio, with Mediterranean glacial-interglacial 5'°O 
amplitudes of 2.5-3%0 compared to Red Sea amplitudes of 5.5-6%0; 
and (2) the Mediterranean hydrological cycle is more complicated than 
in the Red Sea, with major rivers that integrate information from a large 
catchment area with influences from both temperate climate condi- 
tions and the African monsoon!™"!"!”-!°, However, the Mediterranean 
provides uninterrupted sediment records dating back to the end of the 
Messinian salinity crisis at 5.33 Myr ago, from tectonically uplifted mar- 
ine sediments and long deep-sea sediment cores”””’. Eastern Mediter- 
ranean planktonic foraminiferal 3180 records (85'°O,) have recently been 
synthesized into a 5.3-Myr ‘Mediterranean stack’ with a millennially 
resolved, orbitally tuned chronology”. It is particularly beneficial for 
sea-level reconstruction that orbital tuning of the Mediterranean record 
(in contrast to deep-sea 3'8O, records) makes no assumptions about 
the relationship between insolation and ice volume; instead, it employs 
a timing relationship between insolation and African monsoon intens- 
ity (using sedimentary cycles)””. The Mediterranean chronology is so 
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well established that it underpins global geochronology throughout the 
time interval considered here’. Therefore, the eastern Mediterranean 
5'°O, stack” is an excellent resource for developing a long sea-level 
record using a Mediterranean version of the method that was devel- 
oped for the Red Sea (see Methods). This has only recently become 
possible owing to increased quantitative understanding of the relation- 
ship between hydrological processes and 8'*O changes in and around 
the Mediterranean""*?**. 


Converting Mediterranean 5'%0 to sea level 

We quantify relative sea-level changes at Gibraltar (RSLi,) using values 
of eastern Mediterranean 5'°O,, after removal of ‘sapropel’ intervals of 
major surface freshwater dilution. These intervals are associated with 
periods of sea-floor anoxia and are typically marked by dark olive to 
black organic-rich sediments (bounded by pale organic-poor deposits), 
with light surface-water 5'°O anomalies, elevated sedimentary Ba/Al 
ratios and an absence of benthic microfossils'**!”* ”. We exclude sapro- 
pel intervals from RSL, on the basis of a combination of oO, anom- 
aly detection and visual evidence (Extended Data Fig. 1). 

Our RSLgip reconstruction method is explained in detail in Methods 
(with code in Supplementary Information). The method exploits the 
influences of sea-level and buoyancy-loss changes on a hydraulic con- 
trol model”? for the Strait of Gibraltar'*’°*', which has been indepen- 
dently validated (within uncertainties) by other analytical and numerical 
solutions**~**. The strait model is connected to a basin-representation 
box model, which includes summer and winter mixed-layer separation, 
and oxygen isotope fractionation calculations'*”’. The basin model is iden- 
tical to that detailed in ref. 19, except that we here omit the so-called 
‘monsoon box’, which is relevant only to sapropels (which are excluded 
here). Following previous Mediterranean habitat identifications’, two 


Sea level (m) 


Sea level (m) 


5'°O,-to-RSL ‘converters’ are presented: one for (upper) Mediterranean 
Intermediate Water dweller Neogloboquadrina pachyderma (dextral); 
and one for summer mixed-layer dweller Globigerinoides ruber (white). 
These ‘converters’ (notably that for G. ruber) are here used to determine 
changes in RSL, from non-sapropelic eastern Mediterranean 30, 
data (Extended data Fig. 2). Propagated uncertainties in individual RSLei, 
estimates are up to ~20 m (1o; see Fig. 1 and Methods), but uncertainty 
in the mean signal is smaller owing to autocorrelation in the record. A 
probabilistic assessment that combines RSLgj, uncertainties with chro- 
nological uncertainties yields a ‘probability maximum record with a 95% 
probability interval of +6.3 m (see Fig. 2 and Methods). 

The RSLgjp method relies on two underlying assumptions. The first 
is that the Strait of Gibraltar in the past has exerted hydraulic control 
on water exchange in a similar manner to today”. Large-scale tectonic 
movement would be detected as a breakdown, or major drift, in the sea- 
level solutions. Given that the Strait of Gibraltar probably formed dur- 
ing a terminal Miocene event that may have been followed by intense 
crustal adjustments and erosion**”, it is best to consider the earliest por- 
tion of RSLeip with caution. In the interval younger than 3.3 Myr, confi- 
dence increases owing to comparison with other sea-level reconstructions 
(see below), and because of indications that exchange flow through the 
Strait of Gibraltar adopted a modern-type configuration from about 
3.8 Myr ago** (Methods). The second assumption concerns our 5'°O- 
to-RSL ‘converter’. It relies on Late Pleistocene parameter relationships 
with generous uncertainty ranges (Methods), and assumes that past 
relationships remained within these uncertainty ranges. Again, we expect 
deviations from this assumption to cause a breakdown, or major drift, in 
the solutions. Validation between RSLgj, and independent methods sug- 
gests that this assumption is valid, especially in the past 1.5 Myr and 
probably throughout (at least) the past 3.3 Myr (Figs 1, 2). Nonetheless, 


Age (kyr) 


Figure 1 | RSLgip compared with RSLgem. RSLgem has been previously 
validated against a wide range of independent sea-level benchmarks''™"'. 

a, RSLoip for eastern Mediterranean sediment core LC21 (red) with lo error 
bars (orange), and RSLgem (black) with 26 error bars’''”° as well as the 
probabilistically assessed 95% probability envelope (shading)''. Individual 
Mediterranean data comply with the 2¢ envelope for the Red Sea data. b, RSLein 
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for an eastern Mediterranean stack” (red) with 1o error bars (orange), and 
RSLgem (black) with 2¢ error bars’"’°. Gaps in the RSLgjp records result from 
removal of sapropel(-like) events, but some residual influences of freshwater 
influxes on Mediterranean 5'%O (ref. 19) may remain immediately adjacent to 
these intervals. An apparent ‘undetected’ sapropel-like event (yellow bar) is also 
indicated (Methods). Note that a and b cover different age ranges. 
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Figure 2 | RSLgi, for an eastern Mediterranean 30, stack, compared with 
other sea-level estimates. a, Interval from 0 to 2 Myr ago. b, Interval from 2 to 
5.4 Myr ago. RSLeip (based on the record of ref. 22) is presented using all non- 
sapropelic data points (red dots), along with the median (red line) from 
probabilistic analysis with its 95% probability interval (light orange shading). 
In sapropel intervals, marked by absence of (red) data points, linear 
interpolation of the 95% probability interval is shown for aesthetic reasons only 
(Methods). Other sea-level estimates are from Mg/Ca-based T,,-corrected 
deep-sea 8180, for the southwest Pacific!* (blue; see Methods for its 


given the potential caveats to our method with respect to long-term 
tectonic and climate-regime changes, we emphasize the need for new 
continuous and highly resolved records from independent methods to 
strengthen mutual validations before 1.5 Myr ago, and especially before 
3.3 Myr ago (Methods). 

Wealso make an initial assessment of land movement due to glacio- 
hydro-isostatic adjustment at the Camarinal sill, which is the shallowest 
and hydraulically limiting passage on the Atlantic side of the Gibraltar 
narrows’’ (Methods). We find that the glacial-interglacial amplitude 
of RSL, underestimates global mean (eustatic) sea level (hereafter ESL), 
and that the offset scales proportionally with the variation in land ice 
(where ESL = 1.23 RSLgip; see Methods), so that ESL amplitude varia- 
tions will be larger than those of RSLip. This scaling suggests Pliocene 
isostatic adjustments in the region that are compatible with previous 
estimates*”. Regardless, in our comparisons with estimates from other 
methods (Figs 1 and 2), we plot RSLcip instead of ESL, because issues about 
extension of our isostatic assessment back in time, and to periods with 
sea level considerably above the present level, remain to be constrained. 
We note that the international ‘Pliocene Maximum Sea Level’ (PLIOMAX) 
project currently estimates ESL at 12-32 m for the period 3.3-2.9 Myr 
ago (M. E. Raymo, personal communication), similar to the Pliocene range 
of 9-31 m used in ref. 4. These values agree well with RSL, fluctua- 
tions during that period (Fig. 2). PLIOMAX estimates do not specify 
the nature and magnitude of temporal variability, so our RSLeip record 
provides the first quantitative view of the secular evolution of sea level 
during the Pliocene that is independent of deep-sea 8'°O,. 


RSLgip Validation 

Where overlap exists, comparison of RSLgi, with independent sea- 
level reconstructions reveals good agreement (Figs 1 and 2). For younger 
intervals (<0.5 Myr ago), we now assess RSLgip values derived from 


Age (kyr) 


probabilistic presentation here) and for the North Atlantic’® (black; 3-point 
moving average); a model-based deconvolution of deep-sea 5180, (ref. 12) 
(green); and conversion of deep-sea 5'*O, with support from New Zealand 
sequence stratigraphic data’ (purple). The last was vertically positioned to 
agree with the 12-32 m ESL estimate for the period 2.9-3.3 Myr ago 
(PLIOMAX data presented at PALSEA2 meeting; M. E. Raymo, personal 
communication; dark blue dashed box). Three apparent ‘undetected’ 
sapropel-like intervals in RSLeip are indicated (yellow bars; Methods). 


different 5 Oy data sets on their respective timescales. RSLoi, from 
oO, in aslein Mediterranean sediment core LC21—which has a close- 
ly related chronology to RSLgem''—agrees well with RSLgem over the 
last glacial cycle through intervals of both deglaciation (145-125 kyr 
ago) and glacial inception (120-65 kyr ago), and has the additional bene- 
fit of resolving changes between 26 and 14 kyr ago, where RSLgem is 
poorly resolved due to aplanktonic conditions in the Red Sea (Fig. 1a). 
Using an eastern Mediterranean em stack’, and allowing for dif- 
ferent age models, RSLgit also compares well with RSLgem over the past 
0.5 Myr (refs 1, 40; Fig. 1b). Agreement between the two RSL records 
from different ocean margins reflects the fact that both areas have gen- 
erally comparable glacio-hydro-isostatic responses (Methods). The prin- 
cipal disagreements lie close to sapropel(-like) intervals of freshwater 
dilution in the Mediterranean, which tend to bias RSLgi, towards higher 
values. This suggests that residual effects of these events may occasion- 
ally remain, owing to imperfect detection/removal of sapropel(-like) inter- 
vals (Methods). Future work can use same-sample multi-proxy approaches 
to improve this situation, and reinstate the ‘monsoon box’ in the model’”® 
to try and resolve RSLgj, through sapropel intervals, but these are multi- 
year efforts beyond the scope of the present study (Methods). 

Next we compare the full RSLj, record with available sea-level recon- 
structions for older (>0.5 Myr ago) intervals (Fig. 2). We observe strong 
agreement between RSLgip and sea-level estimates from Mg/Ca T4.- 
corrected deep-sea 3/80, over the past 1.5 Myr (ref. 14; Fig. 2a), which 
independently supports the intensification of glacials across the Mid- 
Pleistocene transition that was first inferred from the southwest Pacific 
Tas-corrected deep-sea 5'°O, record'*. However, before ~1.5 Myr ago, 
highstand values of RSLi, seem higher than those in other studies'*»’. 
Apart from RSLgjp, the only records with continuity across the past 3 Myr 
are an Atlantic Ta,-corrected deep-sea 3180, record!’, and the model- 
based estimates of ref. 12. Relative to RSLgit, both suggest a lower mean 
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sea level between ~ 1.5 and 3.2 Myr ago (Fig. 2). If we assume that this 
difference arises from bias in RSLgip, then it might reflect a more open 
strait before ~1.5 Myr ago, possibly due to an uplift event in the Strait 
of Gibraltar at ~ 1.5 Myr ago. However, such an event would also affect 
the sea-level sensitivity (amplitude response) of Mediterranean 5'%O. 
This is difficult to reconcile with the observation that RSLgi, amplitude 
variations agree well with those in a record based on scaling of a New 
Zealand sequence stratigraphic record between 3.4 and 2.3 Myr ago””, 
and also with those in the Atlantic record (allowing for resolution and 
chronological differences)’* (Fig. 2b). Hence, we suggest that any change 
in RSLgip at ~1.5 Myr ago is more likely to reflect a “baseline shift’ in 
Mediterranean climate conditions from a warm/moist state to a warm/ 
arid state, rather than a tectonic step at the Strait of Gibraltar. Alter- 
natively, we might assume that the bias is not due to RSLgip, given that 
there is good agreement between RSLgj, and PLIOMAX estimates for 
the 2.9-3.3 Myr interval (Fig. 2b). That would suggest that problems 
may instead lie within the Atlantic T4,-corrected deep-sea 5'°O, record'® 
(as also suggested before; see, for example, ref. 16), and within the model- 
based estimates of ref. 12. Further independent validation is needed 
before this can be settled, but—regardless—major signal-amplitude sim- 
ilarity in all independent observational methods over the past 3.3 Myr 
challenges the substantially different sea-level inferences from model- 
based deconvolution of deep-sea 3180, (ref. 12; Fig. 2a, b). 


Deep-water temperature variability 

Next, we use RSLgip to derive information about global Tg, changes. 
Owing to uncertainty in the long-term RSLgip:ESL scaling, we consider 
two scenarios, one of which relies on direct use of RSLgj, and the other 


onan ESL estimate of 1.23 X RSLoin (Figs 3, 4). We translate these into 
estimates of seawater 5'°O (8'°O,,) changes using a ratio of (0.009 + 
0.001)%o m™' (refs 14, 41, 42; Figs 3b and 4b). Subtraction of 5180,, 
from a global deep-sea 5'*O, stack (Figs 3a and 4a), following slight 
adjustment of the chronology of this stack to that of the RSLgj, record 
(Methods; Extended Data Table 1), yields residuals that approximate 
global Ty, changes in a 0.25%o °C ratio’. Thus, we estimate global Ty, 
changes over the past 5.3 Myr, with propagated (20) uncertainties of 
about +0.6 °C (Figs 3c and 4c). Our estimates agree well with indepen- 
dent Mg/Ca-based Ty, estimates for a site that is thought to approxi- 
mate global mean deep-water conditions’ (Figs 3c and 4c). The observed 
mutual consistency over glacial—interglacial cycles and longer timescales 
between this Mg/Ca-based T,, record and associated sea-level reconstruction”, 
and our RSLgip and associated Tg, reconstruction, suggests that over the 
past 1.6 Myr (1) seawater Mg/Ca ratios did not change significantly and 
(2) Strait of Gibraltar morphology and the Mediterranean ‘baseline 
climate state’ experienced no major changes. In addition, the lower- 
resolution Atlantic Ty, record’’ also validates a major Tas drop in our 
reconstruction at ~2.73 Myr ago (see below; Fig. 3c). 


Timing and magnitude of glaciations 


Several key observations can be drawn from our analysis. First, we extend 
to 5.3 Myr ago the conclusion, previously drawn for the past 1.5 Myr 
(ref. 14), that global deep-sea 5'°O (see, for example, refs 13, 43) does 
not adequately capture ice-volume history because its two main com- 
ponents (Ty, and ice-volume effects) underwent distinctly different tem- 
poral developments (Fig. 3). Second, regarding the onset of Quaternary 
glacial cycles, we find that a distinct deep-sea cooling step at 2.73 Myr 
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Figure 3 | Deep-sea temperature and 5'°O,, components of deep-sea 5'°O,,. 
a, Deep-sea 5'*O, (ref. 43) on original chronology (magenta) and chronology 
tuned to that of ref. 22 (black). b, Component of sea-level-based ocean 5'°Oy, 
variations (in black based on RSL¢jp and in green for our ESL approximation, 
both using 0.009% m~'), compared with 5'%0,, for the southwest Pacific! 
(blue; 3-point moving average). All variations are assessed relative to present. 
c, Residual 5'°O component that is ascribed to Tg, changes using 0.25%0 °C’. 
Gaps in b and c relate to sapropel(-like) intervals. For details see Methods. Also 
shown in c are 3-point moving averages of Mg/Ca-based Tg, records for the 
North Atlantic’’ (orange) and the southwest Pacific (blue)'*. The southwest 
Pacific record is shown in original Mg/Ca units (as made available), but is 
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exactly scaled to Ty, variations on the other axes as described in ref. 14. Error 
bars: a, mean uncertainties reported for deep-sea 8/80, in the global stack”; 
b, propagated uncertainty in the sea-level (RSL¢ip)-based 5480,, change 
component, based on the 95% probability envelope to the median (Methods, 
and Fig, 2) and a +0.001%om™' uncertainty in the conversion to 8!%O,,; and 
c, propagated uncertainties from a and b. Yellow bars as in Fig. 2. A major deep- 
sea cooling (green dashed line), and the first ‘deep’ glaciation (sea-level 
lowering below —70 m; blue dashed line) are indicated. Red lines are 
straightforward polynomial fits shown only to highlight general long-term 
trends (all based directly on RSLgi,). Age on x axis is based on the chronology of 
ref. 22. A magnification of the past 1.5 Myr is shown in Fig. 4. 
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Figure 4 | Expanded version of Fig. 3 for the past 1.5 Myr only. a-c, As in 
Fig. 3. Error bars: a, mean uncertainties reported for deep-sea 5'°O, in the 
global stack"; b, propagated uncertainty in the sea-level (RSLgi,)-based 50, 


ago substantially pre-dated the first major glaciation in our record, by 
0.58 Myr (Fig. 3). Note that bias due to sapropel intervals is towards 
high RSLgip values, and that comparisons between well-defined RSLeip 
lowstand values are robust relative to this bias. This strengthens con- 
fidence in our identification of the first major lowstand, especially 
because data for the event at 2.15 Myr ago appear to be ~60 m (3a) 
lower than for any preceding lowstand (Fig. 2). 

Current concepts for the onset of Northern Hemisphere glaciation 
rely strongly on deep-sea 5'°O, data, and suggest a shift to stronger 
glacials at ~2.7-2.5 Myr ago (refs 43, 44; Fig. 3a). Our new data chal- 
lenge this perspective, because the change at 2.73 Myr ago appears to 
relate to cooling, whereas the first ‘deep’ (sea level below — 70 m) glacial 
occurred considerably later, at 2.15 Myr ago. Pronounced cooling at 
~2.73 Myr ago is supported not only by the independent Atlantic deep- 
sea Mg/Ca record’* (Fig. 3c), but also by an alkenone-based North 
Atlantic surface temperature record’®, and the culmination ofa long-term 
equatorial Pacific cooling trend*. It is consistent with ample evidence 
for widespread ocean and climate change at ~2.7 Myr ago (Extended 
Data Table 2), including glaciation on Greenland and Scandinavia (see 
synthesis in ref. 8). Apparent temporal association of this cooling with 
a decline in atmospheric CO, levels (see, for example, ref. 46) suggests a 
causal link. For instance, stratification in both the North Pacific and the 
Southern Ocean intensified at ~2.7 Myr ago in association with deep- 
sea cooling, leading to increased ocean carbon storage’. These are key 
regions of deep and intermediate water formation (particularly the South- 
ern Ocean), and changes in their overturning circulation may, therefore, 
strongly influence widespread oceanic carbon storage and, hence, atmo- 
spheric CO, levels (see, for example, ref. 47). Our inferred first ‘deep’ 
glacial at 2.15 Myr ago also falls within a window of major climatic and 
oceanic changes (Extended Data Table 2), including a major cooling 
in tropical sea surface temperatures”, but its nature requires further val- 
idation (for example, through extension of the record of ref. 14). 

Terrestrial indications that a major North American ice sheet devel- 
oped to low latitudes (39° N) date to ~2.4 Myr ago, while the earliest 
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change component, based on the 95% probability envelope to the median 
(Methods, and Fig. 2) and a +0.001%o m”* uncertainty in the conversion to 
880,; and c, propagated uncertainties from a and b. 


record of significant North American-sourced ice-rafted debris suggests 
that ice sheets extended to marine margins at 2.64 Myr ago (refs 7, 8). 
RSLgip has amplitudes of 50-70 m at that time (Fig. 2b), which imply 
that early ice sheets had relatively low profiles relative to their large 
inferred areas. Such ‘low-slung’ ice sheets may have existed because basal 
friction was lower during early glacial cycles than during more recent 
ones*?”°, 

Finally, we infer that similar amplitudes among three of the last 
four glacial maxima in the global deep-sea 3/80, stack? (Figs 3a and 4a) 
may obscure increasing ice-volume contributions (Figs 3b and 4b) that 
are compensated by decreasing deep-sea temperature contributions 
(Figs 3c and 4c). Two independent methods (this Article and ref. 14) sug- 
gest that the Last Glacial Maximum was one of the most intense glacia- 
tions in terms of ice volume, but that its deep-sea temperatures may 
have been relatively ‘mild’ by glacial standards (Figs 3 and 4). This ap- 
parent difference in Tj, between glacials is smaller in the southwest 
Pacific record than in our reconstruction (Fig. 4); possibly, it was most 
notable in the Atlantic Ocean, which dominates the 5'°O, stack that 
we used to estimate T4,. The pattern also is less evident in RSLgem, but 
RSLgem is known to be deficient through the LGM'®”’ (Fig. 1). At this 
stage, therefore, the inferred pattern is suggestive only; it requires vali- 
dation from both improved RSL, reconstructions based on continu- 
ous multi-proxy core records, and additional deep-sea benthic Mg/Ca 
records. If validated, then it may reflect the importance of atmospheric 
moisture supply (and reduced atmospheric moisture capacity with de- 
creasing temperature) in determining total ice accumulation. 


METHODS SUMMARY 


In Methods, we explain (1) our Mediterranean relative sea level at Gibraltar 
(RSLgip) calculations, including elimination of so-called sapropelic intervals with 
freshwater dilution, a discussion of long-term tectonic effects, an assessment of 
glacio-hydro-isostatic influences and an outline of scope for future refinements; 
(2) our probabilistic assessment of the sea-level record of ref. 14; and (3) our use of 
RSLgip with the deep-sea 5'%O stack of ref. 43 to determine changes in deep-sea 
temperature (Tys). A full copy of our PTC MathCad 13 worksheet, which was used 
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to calculate the RSLgj, relationship with eastern Mediterranean 3/80, as measured 
on the planktonic foraminiferal species Globigerinoides ruber (white) and Neoglo- 
boquadrina pachyderma (dextral), is available in Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Juno is the egg Izumo receptor and is 
essential for mammalian fertilization 


Enrica Bianchi’, Brendan Doe’, David Goulding’ & Gavin J. Wright' 


Fertilization occurs when sperm and egg recognize each other and fuse to form a new, genetically distinct organism. The 
molecular basis of sperm-egg recognition is unknown, but is likely to require interactions between receptor proteins 
displayed on their surface. Izumol is an essential sperm cell-surface protein, but its receptor on the egg has not been 
described. Here we identify folate receptor 4 (Folr4) as the receptor for Izumol on the mouse egg, and propose to rename 
it Juno. We show that the Izumol-Juno interaction is conserved within several mammalian species, including humans. 
Female mice lacking Juno are infertile and Juno-deficient eggs do not fuse with normal sperm. Rapid shedding of Juno 
from the oolemma after fertilization suggests a mechanism for the membrane block to polyspermy, ensuring eggs normally 
fuse with just a single sperm. Our discovery of an essential receptor pair at the nexus of conception provides opportunities 
for the rational development of new fertility treatments and contraceptives. 


Fertilization is the culminating event in sexual reproduction and requires 
the fusion of haploid sperm and egg to create a new, genetically distinct, 
diploid organism. Sperm acquire the ability to fertilize the egg within the 
female reproductive tract by exposing previously concealed receptor 
proteins onto their surface following the acrosome reaction’. Once fer- 
tilized, both the oolemma and zona pellucida are biochemically altered, 
making the egg unreceptive to additional sperm and thereby reducing 
the chances of creating nonviable polyploid embryos’. Several receptor 
proteins have been implicated in the recognition and/or fusion process’, 
but just two significantly affect fertility in vivo: IZumol on sperm’, and 
CD9 on eggs*”. Izumo] (named after a Japanese marriage shrine) is redis- 
tributed to the surface of capacitated sperm’, and Izumo1-deficient male— 
but not female—mice are infertile because sperm lacking Izumol cannot 
fuse with eggs*. Recombinant Izumol1 binds both wild-type and CD9- 
deficient eggs, suggesting that Izumol interacts with an egg receptor 
other than CD9°. Glycophosphatidylinositol (GPI)-anchored recep- 
tors on the egg are essential for fertilization because removing them 
either enzymatically”® or genetically” renders eggs infertile. Despite these 
advances, the molecular basis of gamete recognition in mammals is 
unknown, in part, this is due to the scarcity of eggs, the challenges in 
solubilising membrane-embedded proteins, and the often transient 
nature of their extracellular interactions’”. To address these challenges, 
we have developed techniques to identify low-affinity extracellular inter- 
actions and apply them here to investigate sperm—egg recognition in 
mammals. We now describe the Izumol egg receptor as folate receptor 
4 (Folr4), a GPI-anchored protein expressed on the egg surface that is 
essential for female fertility. Because of its role in fertilization and inabil- 
ity to bind folate, we have renamed this protein Juno’ after the Roman 
goddess of fertility and marriage. 


Juno is the Izumol receptor on oocytes 

To identify an egg receptor for Izumol, we expressed the entire ecto- 
domain of mouse Izumo] in mammalian cells for use as a binding probe. 
With the expectation that interactions with its extracellular egg receptor 
would be weak, we oligomerized the Izumol ectodomain to increase 
binding avidity by using a peptide sequence from a cartilage protein 
that forms pentamers’* (Extended Data Fig. 1). The avid Izumol probe, 


but not a control protein, bound the oolemma of mouse oocytes (Fig. 1a). 
To determine the molecular identity of the egg binding partner for Izumol, 
we used a mouse oocyte complementary DNA library in an iterative 
expression cloning approach (Extended Data Fig. 2). In each round of 
selection, an increasing fraction of transfected cells bound Izumo] until 
three independently selected plasmid clones conferred Izumo] binding 
(Fig. 1b). All three clones contained the same cDNA, encoding the folate 
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Figure 1 | Juno is the GPI-anchored oocyte surface receptor for Izumol. 

a, An avid recombinant Izumol protein, but not a control, binds the oolemma. 
b, Izumol binds HEK293 cells transfected with a Juno cDNA (clone B2), but 
not untransfected controls. c, Juno is highly expressed on the oolemma of 
unfertilized eggs. d, Preincubation of eggs with an anti-Juno, but not an 
anti-CD55 control antibody, blocked binding of recombinant Izumo] protein. 
e, f, Cell-surface Juno was lost after treatment with PIPLC on Juno-transfected 
HEK2393 cells (e) or eggs (f). Images are single optical sections of unfertilized 
mouse metaphase II eggs; scale bars, 20 um in a, ¢, d, f} and 10 pm in b. 


1Cell Surface Signalling Laboratory, Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK. 2Mouse Production Team, Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK. 
3Electron and Advanced Light Microscopy Suite, Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK. 
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receptor 4 (Folr4) gene. Folr4 is one of three folate receptor paralogues 
in mouse whose main role is thought to involve folate uptake. Folr4 
is expressed on CD4*CD25" regulatory T cells in mice! and is being 
tested as an anti-tumour therapy'*’’ as well as a mediator of responses 
to dietary folate’*"*, but no functional role on oocytes has been reported. 
Using recombinant proteins, we showed that, unlike Folr1 and Folr2, 
Folr4 was unable to bind to folate (Extended Data Fig. 3a), which was 
expected given differences in amino acids known to be critical for folate 
binding”’”° (Extended Data Fig. 3b). On the basis of these and subse- 
quent findings, we renamed the gene encoding this protein ‘Juno’. Using 
an anti-Juno monoclonal antibody, we showed that Juno expression 
matched the binding pattern of the recombinant Izumo probe on ovu- 
lated oocytes (Fig. 1c). Preincubating oocytes with the anti-Juno antibody 
prevented all detectable binding of the Izumo] probe, demonstrating 
that Juno is the sole Izumo] receptor on oocytes (Fig. 1d). Similar to other 
folate receptor paralogues, the protein sequence of Juno suggested the 
presence of a carboxy-terminal GPI-anchor site. Given the known impor- 
tance of oolemmal GPI-linked proteins in fertilization’®"', we tested 
whether Juno was tethered to the membrane by a GPI anchor. A large 
fraction of cell-surface Juno staining was lost after phosphoinositide 
phospholipase C (PIPLC) treatment of either HEK293 cells transfected 
with the Juno cDNA (Fig. le) or oocytes (Fig. 1f), demonstrating that Juno 
was GPI-anchored, consistent with previous results”. Taken together, 
these data identify Juno as the Izumo] receptor expressed on oocytes. 


Izumol1-Juno interaction is conserved 


To show that Izumol and Juno interacted directly, and to quantify the 
biophysical parameters of the interaction, we expressed and purified 
the entire ectodomain of Juno and measured its binding to Izumo] using 
surface plasmon resonance (SPR) (Fig. 2a). Like other extracellular inter- 
actions measured using this technique, the interaction was shown to 
be highly transient with a Kp of ~12 1M (Extended Data Fig. 4). Juno 
contains a single globular domain’, whereas Izumo1 contains both 
an immunoglobulin superfamily domain and a unique amino terminus 
termed the Izumo domain”. Consistent with the recent finding that the 
Izumo domain is sufficient for egg binding’, we expressed the individual 
domains of Izumo] and showed, using an assay designed to detect direct 
low-affinity extracellular protein interactions called avidity-based extra- 
cellular interaction screen (AVEXIS)”*, that it is the Izumo domain that 
contains both the OBF13 epitope and Juno binding site (Extended Data 
Fig. 5). Both Izumol and Juno belong to small paralogous gene fami- 
lies which, in mouse, contain four and three members, respectively. We 
expressed the entire ectodomains of Izumol1, 3 and 4 (the Izumo2 para- 
logue did not express) and used the AVEXIS assay to screen systemat- 
ically for all possible pairwise interactions with the three Folr paralogues. 
We observed that only mouse Izumol and Juno could interact (Fig. 2b). 
Clearly identifiable Izumo1 and Juno orthologues exist in all sequenced 
mammalian genomes, including marsupials. To determine whether the 
interaction was conserved within mammals, we expressed the entire 
ectodomains of both Izumol and Juno orthologues from humans, pig 
(Sus scrofa) and opossum (Monodelphis domestica) and assessed bind- 
ing using the AVEXIS assay. Clear binding between the orthologues 
was observed, demonstrating that the interaction is conserved within 
mammals (Fig. 2c). 


Juno is essential for fertilization 


To assess the role of Juno in fertilization, we first added an anti-Juno 
monoclonal antibody that blocks the Izumo-Juno interaction (Fig. 1d) 
into in vitro fertilization (IVF) assays. The anti-Juno antibody potently 
prevented fertilization (Fig. 3a), with no detectable fertilization events 
even when used at concentrations as lowas 0.1 ug ml’ (Extended Data 
Fig. 6). Using similar approaches, previous research in this area has 
identified promising candidates for sperm-egg recognition that, when 
genetically deleted, have been shown to be dispensable for fertilization’. 
To determine whether Juno was essential for fertilization, we created 
Juno-deficient mice using embryonic stem cells disrupted at the Folr4 
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Figure 2 | The Izumo1-Juno interaction is direct, transient and conserved 
across mammals. a, Biophysical analysis of the Izumol1-Juno interaction using 
SPR. Serial dilutions of purified, soluble Juno were injected over immobilised 
Izumol, and kinetic parameters derived from a 1:1 Langmuir binding model 
(red lines). b, Binding specificity within the paralogues of the mouse Izumo and 
Folr families: only Juno (Folr4) bound Izumol.c, The Izumo1-Juno interaction 
is conserved across mammals. AVEXIS was used for binding analysis in b 
and c using recombinant Folr/Juno proteins as preys and Izumo proteins as 
baits; control bait in c was CD4. All bar charts show mean + s.e.m.; n = 3. 


locus using a gene trap allele: Folr4’”""@“0™“*" (Extended Data Fig. 7). 
Both Juno ‘~ male and female mice developed indistinguishably from 
wild-type controls, showing normal rates of growth and morphology 
and were overtly healthy. Whereas Juno ‘~ male and Juno*’ female 
mice were fertile, by contrast, Juno’ ~ females failed to produce any 
litters during three months of continuous mating with wild-type males 
of proven fertility (Fig. 3b). Female Juno ’~ mice exhibited natural 
mating behaviours, as assessed by vaginal plug formation and the pre- 
sence of motile sperm in the reproductive tract when paired with fertile 
wild-type males. Juno ‘~ females responded to hormone treatment by 
ovulating morphologically normal eggs at numbers that did not sig- 
nificantly differ from wild-type. Juno ’~ eggs recovered at embryonic 
day 0.5 after superovulation and natural mating were not fertilized and 
had more sperm within their perivitelline space compared to wild-type 
eggs, demonstrating that the zona pellucida of Juno ‘~ eggs could be 
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Figure 3 | Juno is essential for female fertility. a, An anti-Juno monoclonal 
antibody potently blocked in vitro fertilization; anti-Izumol is shown for 
comparison. b, Female Juno ’ ~ mice are infertile. Continuous mating of 
Juno ‘~ female mice to proven wild-type males for 3 months did not result in 
any pups. c, Greater numbers of sperm were observed in the perivitelline space 
(PVS) of eggs from superovulated Juno-deficient (—/—) mice relative to 
wild-type (+/+).d,e, Zona-free Juno ’~ eggs do not fuse with wild-type sperm 
in vitro. Sperm were added to Juno-deficient (—/—) and wild-type (+/+) eggs 
and fused sperm quantified after two hours (d), or pronuclei after six hours 
(e). All bar charts show mean = s.e.m.; numbers in parentheses are total 
number of eggs (a, c, d, e) or number of mating pairs (b). 


penetrated by wild-type sperm in vivo, which then either did not bind 
or fuse with the oolemma (Fig. 3c). To investigate this further, we removed 
the zona pellucida from Juno ’~ eggs and found that they could not be 
fertilized with wild-type sperm in IVF assays, as assessed by counting 
the number of fused sperm and formed pronuclei per egg (Fig. 3d, e). 
This is consistent with a role for the Izumol-Juno interaction in sperm- 
egg adhesion or fusion. To distinguish between these possibilities, nor- 
mally non-fusing HEK293 cells were separately transfected with plasmids 
encoding either Juno or Izumo1 cDNAs and mixed. No syncytia were 
observed in the mixed cultures, indicating that the interaction is not 
sufficient for cell fusion (Extended Data Fig. 8a). Furthermore, acrosome- 
reacted normal sperm did not bind as efficiently to zona-free Juno- 
deficient eggs as wild-type controls (Extended Data Fig. 8b). Together, 
these findings indicate that the Izumo1-Juno interaction is a necessary 
adhesion event between sperm and egg that is required for fertilization. 


Juno is rapidly shed after fertilization 


After fertilization, oocytes become largely refractory to further sperm 
fusion events to prevent the creation of nonviable polyploid embryos 
due to polyspermy. This is achieved through both a relatively slow- 
acting (> 1h) hardening of the zona pellucida caused by the action of 
enzymes released from cortical granules after egg activation, and a faster- 
acting block involving biochemical changes in the oolemmal membrane. 
In broadcast-spawning frogs and aquatic invertebrates, the membrane 
block to polyspermy is very fast (a few seconds to minutes) and is achieved 
by electrically depolarizing the oolemma™. In mammals, the membrane 
block occurs over a longer timeframe (30-45 min), and despite being 
first described 60 years ago, its mechanistic basis remains a mystery’. 
Given the essential role of Juno in fertilization, we explored the role it 
might play in establishing the membrane block to polyspermy. Inter- 
estingly, cell-surface Juno was rapidly lost after fertilization, being only 
weakly detectable in zona-intact fertilized eggs at telophase II and unde- 
tectable at the pronuclear stage (Fig. 4a). Similarly, Juno was barely 
detectable on anaphase-II-stage zona-free oocytes, approximately 30 
to 40 min after fertilization (Extended Data Fig. 9). Folate receptors are 
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Figure 4 | Juno is rapidly shed from the oolemma of normally fertilized 
eggs, but not ICSI-fertilized or parthenogenetically activated eggs. 

a, Cell-surface Juno rapidly becomes undetectable after fertilization. Juno 
(green) is expressed on ovulated metaphase II eggs, but is barely evident at 
telophase II and undetectable on pronuclear-stage zygotes. Arrow and asterisk 
indicate sites of first and second polar body extrusion, respectively; 
chromosomes are not within the plane of focus. b, Immunogold electron 
microscopy localized Juno primarily to the oolemma in unfertilized, ovulated 
eggs, but was redistributed to vesicles within the perivitelline space within 1h 
after fertilization. ZP, zona pellucida. c, Eggs fertilized by ICSI or 
parthenogenetically activated retain oolemmal Juno staining until at least the 
pronuclear stage, as shown. Nuclei/pronuclei were visualized with DAPI (blue). 
Images show representative eggs from three independent experiments. Scale 
bars represent 20 jim (a, c) and 1 tum (b). 


known to take up extracellular dietary folate by endocytosis” and we 
therefore used immunogold electron microscopy to determine the sub- 
cellular localization of Juno following fertilization. Surprisingly, we observed 
that Juno was not internalised after fertilization but present in extra- 
cellular vesicles, presumably derived from the microvillus-rich oolemma 
that undergoes significant architectural changes upon fertilization’® 
(Fig. 4b). Some methods of fertilization, such as intracytoplasmic sperm 
injection (ICSI) and parthenogenetic activation, do not lead to the estab- 
lishment of an effective membrane polyspermic block’. Consistent with 
a role for Juno in the membrane block to polyspermy, eggs fertilized 
either by ICSI, or parthenogenetically activated by ethanol, did not lose 
cell-surface expression of Juno (Fig. 4c). The rapid shedding of Juno 
from the egg membrane within vesicles after fertilization therefore 
provides a possible mechanism for the membrane block to polyspermy 
in mammalian eggs. 


Discussion 

To our knowledge, the discovery of Juno as the binding partner for 
Izumo] identifies these proteins as the first cell-surface receptor pair 
essential for gamete recognition in any organism. Importantly, both 
female Juno and male Izumo1-deficient mice are infertile, suggesting 
that the interaction is essential for normal fertilization. Comparing our 
findings with recent advances in the molecular understanding of cel- 
lular fusion in other biological contexts suggests that the Izumol-Juno 
interaction performs a necessary adhesion step rather than acting as a 
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membrane ‘fusogen’. Unlike established fusogens such as the EFF-1 and 
AFF-1 proteins” or syncytins”, that are sufficient for fusion, Juno and 
Izumol do not induce the formation of syncytia when ectopically 
expressed on neighbouring non-fusing cells. Essential adhesion pro- 
cesses are required in other cellular fusion systems such as myofibre 
formation in both Drosophila” and vertebrates”, suggesting that mem- 
brane fusion requires the action of other membrane proteins such as 
myomaker*’. The expected but remarkably low in vitro binding affinity 
between soluble monomeric Juno and Izumo] suggests that local clus- 
tering of Juno within the oolemma is important to ensure a sufficiently 
high binding avidity for productive interactions, a role that could be per- 
formed by the tetraspanin CD9. Tetraspanins are known to organize other 
membrane proteins within microdomains”, and quantitative differences 
in sperm adhesion to CD9-deficient eggs have been reported’’. This 
may also explain why female mice lacking CD9 have reduced fertility>’, 
unlike Juno-deficient mice which are completely infertile. The ability 
of CD9 to organize Juno within the oolemma may be indirect because 
oocytes lacking CD9 have an altered membrane architecture**”’. 

Several pieces of evidence point to the rapid loss of Juno from the 
egg membrane after fertilization as the mechanistic basis for the mem- 
brane block to polyspermy in mammals. First, cell-surface Juno is 
essential for fertilization; second, it is undetectable on the oolemma 
approximately 40 min after fertilization, in excellent agreement with 
the timing of the membrane bloc 36, third, the membrane block is both 
a graded response” and associated with the loss of sperm binding sites”*, 
consistent with the gradual loss ofa surface receptor; and fourth, surface 
Juno expression is not lost in ICSI-fertilized or parthenogenetically- 
activated eggs that do not form an effective membrane block”. One 
further possibility is that because Juno is shed as vesicles after fertiliza- 
tion, this creates a zone of ‘decoy eggs’ confined within the perivitelline 
space that could bind to and efficiently neutralise incoming acrosome- 
reacted sperm to increase the potency of the sperm block. In conclusion, 
our discovery of an extracellular receptor pair essential for fertilization 
provides a focus for the rational development of novel contraceptives 
and fertility treatments. 


METHODS SUMMARY 

Transgenic mice. All animal experiments were performed in accordance with local 
ethical and UK Home Office regulations. Juno-deficient mice (Folr4tm KOMP) Wtsiy 
were generated by the Sanger Institute Mouse Genetics Project from Folr4-targeted 
mouse embryonic stem cells as part of the International Knockout Mouse Con- 
sortium, essentially as described*’. Acrosome reporter mice (B6;B6C3-Tg(Acro3- 
EGEP)010sb)*!, where the enhanced green fluorescent protein labels acrosome-intact 
sperm, were obtained from the RIKEN BioResource Centre. Wild-type eggs were 
obtained from BJCB strain mice for IVF, parthenogenetic activation and ICSI pro- 
cedures. Sibling C57BL6/6N wild-types were used as controls in all other experiments. 
In vitro fertilization assays and ICSI. IVF and ICSI were performed as previously 
described’. Eggs were collected from hormone-primed females and sperm were 
capacitated before mixing. Fertilization and egg activation were assessed by stain- 
ing sperm DNA and egg chromosomes. Acrosome-reacted sperm were identified 
using Acro3-EGFP reporter mice. Juno expression analysis was performed by immu- 
nofluorescence and images acquired with a Leica SP5 confocal microscope. Fertil- 
ization was quantified by counting decondensed sperm heads in the egg cytoplasm, 
or scoring the number of pronuclei. 

Recombinant protein production and expression cloning. All protein expres- 
sion constructs were made by gene synthesis (GeneArt). Protein production, puri- 
fication, AVEXIS assays and SPR were performed essentially as described”**. The 
Juno cDNA was identified by iterative rounds of selection by transfecting adherent 
HEK293T cells with a commercially available mouse oocyte cDNA library, NIH_ 
MGC_257_N (Express Genomics, USA). 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 9 January; accepted 5 March 2014. 
Published online 16 April 2014. 


1. Okabe, M. The cell biology of mammalian fertilization. Development 140, 
4471-4479 (2013). 


486 | NATURE | VOL 508 | 24 APRIL 2014 


16. 


17. 


18. 


19. 


20. 


21. 


22. 
23. 


24. 


25. 
26. 


27. 


28. 


29. 


30. 


31. 


32. 
33. 


34. 


35. 


36. 
Si. 


. Teng, M. W. et al. 


Gardner, A. J. & Evans, J. P. Mammalian membrane block to polyspermy: new 
insights into how mammalian eggs prevent fertilisation by multiple sperm. Reprod. 
Fertil. Dev. 18, 53-61 (2006). 

Ikawa, M., Inoue, N., Benham, A. M. & Okabe, M. Fertilization: a sperm’s journey to 
and interaction with the oocyte. J. Clin. Invest. 120, 984-994 (2010). 

Inoue, N., Ikawa, M., lsotani, A. & Okabe, M. The immunoglobulin superfamily 
protein Izumo is required for sperm to fuse with eggs. Nature 434, 234-238 
(2005). 

Le Naour, F., Rubinstein, E., Jasmin, C., Prenant, M. & Boucheix, C. Severely reduced 
female fertility in CD9-deficient mice. Science 287, 319-321 (2000). 

Miyado, K. etal. Requirement of CD9 on the egg plasma membrane for fertilization. 
Science 287, 321-324 (2000). 
Kaji, K. et al. The gamete fusion process is defective in eggs of Cd9-deficient mice. 
Nature Genet. 24, 279-282 (2000). 

Satouh, Y., Inoue, N., |kawa, M. & Okabe, M. Visualization of the moment o 
sperm-egg fusion and dynamic localization of IZUMO1. J. Cell Sci. 125, 
4985-4990 (2012). 

Inoue, N. et al. Molecular dissection of IZUMO1, a sperm protein essential for 
sperm-egg fusion. Development 140, 3221-3229 (2013). 


mouse 


. Coonrod, S.A. etal. Treatment of mouse oocytes with PI-PLC releases 70-kDa (p/ 5) 


and 35- to 45-kDa (p/ 5.5) protein clusters from the egg surface and inhibits 
sperm-oolemma binding and fusion. Dev. Biol. 207, 334-349 (1999). 


. Alfieri, J. A. etal. Infertility in female mice with an oocyte-specific knockout of GPI- 


anchored proteins. J. Cell Sci. 116, 2149-2155 (2003). 


. Wright, G. J. Signal initiation in biological systems: the properties and detection of 


transient extracellular protein interactions. Mol. Biosyst 5, 1405-1412 (2009). 


. Yamaguchi, T. et al. Control of immune responses by antigen-specific regulatory T 


cells expressing the folate receptor. /mmunity 27, 145-159 (2007). 
ultiple antitumor mechanisms downstream of prophylactic 
regulatory T-cell depletion. Cancer Res. 70, 2665-2674 (2010). 


. Liang, S.C., Moskalenko, M., Van Roey, M. & Jooss, K. Depletion of regulatory T cells 


by targeting folate receptor 4 enhances the potency of a GM-CSF-secreting tumor 
cell immunotherapy. Clin. Immunol. 148, 287-298 (2013). 

Kunisawa, J., Hashimoto, E., Ishikawa, I. & Kiyono, H. A pivotal role of vitamin B9 in 
the maintenance of regulatory T cells in vitro and in vivo. PLoS ONE 7, e32094 
(2012). 
Kinoshita, M. et al. Dietary folic acid promotes survival of Foxp3* regulatory T cells 
in the colon. J. /mmunol. 189, 2869-2878 (2012). 

Salbaum, J. M., Kruger, C. & Kappen, C. Mutation at the folate receptor 4 locus 
modulates gene expression profiles in the mouse uterus in response to 
periconceptional folate supplementation. Biochim. Biophys. Acta 1832, 
1653-1661 (2013). 

Chen, C. et a/. Structural basis for molecular recognition of folic acid by folate 
receptors. Nature 500, 486-489 (2013). 

Wibowo, A. S. et al. Structures of human folate receptors reveal biological 
trafficking states and diversity in folate and antifolate recognition. Proc. Nat! Acad. 
Sci. USA 110, 15180-15188 (2013). 

Jia, Z. et al. A novel splice variant of FR4 predominantly expressed in CD4*CD25* 
regulatory T cells. /mmunol. Invest. 38, 718-729 (2009). 

Ellerman, D. A. et a/. Izumo is part of a multiprotein family whose members form 
large complexes on mammalian sperm. Mol. Reprod. Dev. 76, 1188-1199 (2009). 
Bushell, K. M., Sollner, C., Schuster-Boeckler, B., Bateman, A. & Wright, G. J. 
Large-scale screening for novel low-affinity extracellular protein interactions. 
Genome Res. 18, 622-630 (2008). 
Jaffe, L. A. Fast block to polyspermy in sea urchin eggs is electrically mediated. 
Nature 261, 68-71 (1976). 
Sabharanjak, S. & Mayor, S. Folate receptor endocytosis and trafficking. Adv. Drug 
Deliv. Rev. 56, 1099-1109 (2004). 
Jackowski, S. & Dumont, J. N. Surface alterations of the mouse zona pellucida and 
ovum following in vivo fertilization: correlation with the cell cycle. Biol. Reprod. 20, 
50-161 (1979). 

Podbilewicz, B. et al. The C. elegans developmental fusogen EFF-1 mediates 
homotypic fusion in heterologous cells and in vivo. Dev. Cell 11, 471-481 (2006). 
Mi, S. et al. Syncytin is a captive retroviral envelope protein involved in human 
placental morphogenesis. Nature 403, 785-789 (2000). 

Abmayr, S. M. & Pavlath, G. K. Myoblast fusion: lessons from flies and mice. 
Development 139, 641-656 (2012). 

Powell, G. T. & Wright, G. J. Jamb and Jamc are essential for vertebrate myocyte 
fusion. PLoS Biol. 9,e1001216 (2011). 

illay, D. P. et al. Myomaker is a membrane activator of myoblast fusion and 
muscle formation. Nature 499, 301-305 (2013). 

Hemler, M. E. Tetraspanin functions and associated microdomains. Nature Rev. 
Mol. Cell Biol. 6, 801-811 (2005). 

Jégou, A. et al. CD9 tetraspanin generates fusion competent sites on the egg 
membrane for mammalian fertilization. Proc. Nat! Acad. Sci. USA 108, 
10946-10951 (2011). 

Runge, K. E. et al. Oocyte CD9 is enriched on the microvillar membrane and 
required for normal microvillar shape and distribution. Dev. Biol. 304, 317-325 
(2007). 
Zytkiewicz, E., Nowakowska, J. & Maleszewski, M. Decrease in CD9 content and 
reorganization of microvilli may contribute to the oolemma block to sperm 
penetration during fertilization of mouse oocyte. Zygote 18, 195-201 (2010). 
Wolf, D. P. The block to sperm penetration in zona-free mouse eggs. Dev. Biol. 64, 
1-10 (1978). 

Gardner, A. J., Williams, C. J. & Evans, J. P. Establishment of the mammalian 
membrane block to polyspermy: evidence for calcium-dependent 

and -independent regulation. Reproduction 133, 383-393 (2007). 


©2014 Macmillan Publishers Limited. All rights reserved 


38. 


39. 


40. 


41. 


42. 


43. 


Horvath, P. M., Kellom, T., Caulfield, J. & Boldt, J. Mechanistic studies of the plasma 
membrane block to polyspermy in mouse eggs. Mol. Reprod. Dev. 34, 65-72 
(1993). 

Wortzman-Show, G. B., Kurokawa, M.., Fissore, R. A. & Evans, J. P. Calcium and 
sperm components in the establishment of the membrane block to polyspermy: 
studies of ICSI and activation with sperm factor. Mol. Hum. Reprod. 13, 557-565 
(2007). 

Skarnes, W. C. et al. A conditional knockout resource for the genome-wide study of 
mouse gene function. Nature 474, 337-342 (2011). 

Nakanishi, T. et a/. Real-time observation of acrosomal dispersal from 

mouse sperm using GFP as a marker protein. FEBS Lett. 449, 277-283 

(1999). 

Nagy, A., Gertsenstein, M., Vintersten, K. & Behringer, R. Manipulating the Mouse 
Embryo: A Laboratory Manual 3rd edn (Cold Spring Harbor Laboratory Press, 
2003). 

Kimura, Y. & Yanagimachi, R. Intracytoplasmic sperm injection in the mouse. Biol. 
Reprod. 52, 709-720 (1995). 


ARTICLE 


44. Sun, Y., Gallagher-Jones, M., Barker, C. & Wright, G. J. A benchmarked protein 
microarray-based platform for the identification of novel low-affinity extracellular 
protein interactions. Anal. Biochem. 424, 45-53 (2012). 


Acknowledgements This work was supported by the Wellcome Trust grant number 
098051. Juno-deficient mice were generated by the Sanger Institute Mouse Genetics 
Project. We thank W. Skarnes and J. Bussell for advice on transgenic mice; J. Kerr for 
construct design; A. Bradley and L. Jovine for helpful comments on the manuscript; 
and M. Okabe and N. Inoue for the OBF 13 antibody. 


Author Contributions E.B. and G.J.W. conceived the project, designed and analysed the 
experiments, and wrote the manuscript. E.B. performed all experiments with technical 
help from B.D. (ICSI) and D.G. (electron microscopy). 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to G.J.W. (gw2@sanger.ac.uk). 


24 APRIL 2014 | VOL 508 | NATURE | 487 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature13151 


Origins and functional evolution of Y 
chromosomes across mammals 


Diego Cortez!?, Ray Marin*, Deborah Toledo-Flores’, Laure Froidevaux', Angélica Liechti', Paul D. Waters*, Frank Griitzner? 


& Henrik Kaessmann!? 


Y chromosomes underlie sex determination in mammals, but their repeat-rich nature has hampered sequencing and asso- 
ciated evolutionary studies. Here we trace Y evolution across 15 representative mammals on the basis of high-throughput 
genome and transcriptome sequencing. We uncover three independent sex chromosome originations in mammals and 
birds (the outgroup). The original placental and marsupial (therian) Y, containing the sex-determining gene SRY, emerged 
in the therian ancestor approximately 180 million years ago, in parallel with the first of five monotreme Y chromosomes, 
carrying the probable sex-determining gene AMH. The avian W chromosome arose approximately 140 million years ago in 
the bird ancestor. The small Y/Wgene repertoires, enriched in regulatory functions, were rapidly defined following strati- 
fication (recombination arrest) and erosion events and have remained considerably stable. Despite expression decreases 
in therians, Y/W genes show notable conservation of proto-sex chromosome expression patterns, although various Y genes 
evolved testis-specificities through differential regulatory decay. Thus, although some genes evolved novel functions through 
spatial/temporal expression shifts, most Y genes probably endured, at least initially, because of dosage constraints. 


In most mammals, Y chromosomes are required to override the program 
underlying development of the default sex, females’. Extant mammals 
possess an XY (male heterogametic) sex chromosome system, with rare 
exceptions that experienced secondary XY loss’, but sex chromosomes 
evolved from different autosomes in therians (placentals and marsu- 
pials) and egg-laying monotremes (Fig. 1). Therians share the same XY 
system, whereas monotremes have multiple X and Y chromosomes that 
are partially homologous to bird sex chromosomes**, where females are 
heterogametic (ZW system). Sex chromosome differentiation occurred 
through recombination arrests along the Y, leading to reduced selection 
and associated gene decay and repeat accumulation®’. Consequences of 
Y deterioration for X chromosome evolution were recently investigated’, 
but the fact that this chromosome is refractory to assembly owing to its 
repeat-rich nature’ have hindered evolutionary studies of the Y itself. 
Nevertheless, dedicated efforts determined complete Y sequences in three 
primates”-** (human, chimpanzee, macaque) and large portions of two 
carnivore (dog and cat) Y chromosomes”. Together with smaller scale 
work, these studies provided initial clues to Y evolution, such as the stabili- 
zation of Y gene content through ampliconic sequence accumulation’*"?”. 
However, our understanding of mammalian Y chromosome evolution 
remains limited owing to the restricted amount and phylogenetic rep- 
resentation of available Y chromosome data. 


Mammalian Y gene repertoires 


To explore Y evolution, we developed a subtraction approach that directly 
targets and assembles exons of the male-specific (non-pseudoautosomal) 
Y chromosome (MSY) on the basis of high-throughput sequencing of 
transcriptomes and genomes from both sexes (Extended Data Fig. 1 
and Methods). In brief, male transcripts were assembled from male- 
specific RNA sequencing (RNA-seq) reads not mapping onto female 
reference genomes. Y identity was confirmed using the whole-genome 
sequence data; thatis, true Y transcripts are supported by genomic reads 
unique to males. Genes with no or low expression were detected by screening 


male-specific genomic data with Y orthologues from other species. The 
genomic data also served to support the absence of ancestral Y genes (that 
is, their evolutionary loss). We validated our approach using large-scale 
PCR/Sanger sequencing-based screening of male/female genomic DNA 
and published Y chromosomes, thus confirming that complete coding 
sequences of all Y genes (with the potential exception of those not expressed 
in the sampled tissues and lacking a known Y orthologue) could be deduced 
for a given species. 

We applied our procedure to sequencing data that we collected for 
ten mammals (Supplementary Tables 1-4 and Methods). Together with 
available Y sequences’”’*, we could thus investigate Y evolution in 15 
species covering all major lineages of the class Mammalia" (placentals 
or eutherians, marsupials, monotremes), all but one (Xenarthra) of the 
eutherian superorders”™, the two marsupial superorders (American and 
Australian marsupials), both extant monotreme families (platypus, 
echidna), and all major groups of ‘higher primates’ (that is, simians: 
the great apes, Old World monkeys (OWMs) and New World mon- 
keys (NWMs)). For comparison, we produced similar data for the bird 
(chicken) W chromosome. 

We identified a total of 134 different Y protein-coding genes in the 
ten new species, thus approximately doubling the number of previously 
known Y genes (Fig. 1 and Extended Data Fig. 3; note that the letter Y 
is added at the end of gene symbols for all genes for which Y-linkage is 
detected, following common practice), and 214 distinct pseudogenes 
and noncoding RNAs (Supplementary Tables 5-17 and Supplementary 
Data 1). Our read coverage analysis (Methods) predicts 0-6 multi-copy 
protein-coding genes per species, some of which are shared across species 
and thus presumably belong to conserved ampliconic Y regions"! (Fig. 1 
and Supplementary Tables 5-17). Most of the 155 distinct noncoding 
RNAs seem to stem from ampliconic regions, with two or more ident- 
ical copies per locus (Supplementary Tables 5-17). We recovered all of 
the recently identified chicken W protein-coding genes’*'® and added 
11 distinct noncoding RNAs (Fig. 1 and Supplementary Table 18). 
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Figure 1 | Y (W) protein-coding gene repertoires and their origins. Top, 
Y/W genes and strata (genes are ordered within strata based on ds). Genes 
added to differentiated sex chromosomes (‘Added’) and independently 
recruited genes (red text) are indicated. For echidna and ostrich, only genes 
shared with sister species (platypus and chicken, respectively) were identified 
on the basis of genomic sequencing data. For bird strata age estimations, turkey 
and zebra finch Z gametologues were included. Bottom, phylogenetic tree of the 
19 species. Lineages, divergence times and origins of strata and genes are 
indicated. For therian strata, monotreme S1 and bird S1, refined age estimates 


Origins of therian Y genes 


We next assessed the time at which Y genes began to evolve indepen- 
dently of their X counterpart (gametologue), or when they were recruited 
(through transposition/translocation) to already differentiated Y regions 
from autosomes or the X. We thus reconstructed phylogenetic trees for 
all Y genes, X gametologues, and autosomal orthologues from outgroup 
species with different sex chromosomes (Extended Data Fig. 4, Sup- 
plementary Data 1 and Methods). In these trees, the location of nodes 
reflecting the divergence of Y and X clades, relative to lineage divergence 
nodes, mark the phylogenetic position at which a given Y gene emerged. 
We assessed divergence rates at synonymous sites (ds) before/after dif- 
ferentiation of Y/X gametologues for further refinement of ages (Extended 
Data Fig. 5a-f). 

Y chromosomes consist of distinct regions (‘strata’) that stopped 
recombining at specific time points'®’”. Our phylogenetic dating of the 
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on branches are based on dg trees (Extended Data Fig. 5a-f). More approximate 
placement on branches for platypus $2-S5 and chicken S2-S3 (genes for which 
strata could not be assigned with confidence are listed as undefined, SU) are 
based on pairwise (median) ds values of stratum genes (dashed lines). Note 
that although actual Y chromosome assemblies underlie the previously 
published’* dog/cat Y repertoires shown here, the repertoires may be 
incomplete, because these assemblies (in particular for cat) are incomplete’’. 
+, multi-copy genes; y, pseudogenes; &, intact genes in chimpanzee formerly 
annotated as pseudogenes; A&B 1&2, known duplicates; YAR, Y-added region. 


original eutherian stratum 1 (S1), which, like 2, stems from the same 
ancestral proto-sex chromosome portion (the “Y/X-conserved’ region, 
YCR/XCR; ref. 18), shows that it originated in the last common therian 
ancestor (Fig. 1 and Extended Data Fig. 4a; see the following external 
link for individual trees: ftp://ftp.vital-it.ch/papers/kaessmann/Nature- 
Cortez/Cortez_etal_Nature_YX_gene_trees_alignments.zip), confirm- 
ing that sex chromosomes of humans and other therians emerged after 
the therian—-monotreme split** and were never common to all mam- 
mals, as implied by some previous work’*”’. Notably, therian sex chro- 
mosome origination occurred just before the eutherian—marsupial split 
~180 million years (Myr) ago (Fig. 1 and Extended Data Fig. 5a), consis- 
tent with our previous hypothesis**. $1 contains a common set of four genes, 
including the sex-determining gene SRY', which has been maintained 
since sex chromosome inception in most therians, although some S1 genes 
apparently became dispensable in some species (Fig. 1). Interestingly, 


24 APRIL 2014 | VOL 508 | NATURE | 489 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


marsupials retained twice the number of ancestral S1 genes. Finally, it 
is noteworthy that ds for Y gametologues substantially increased rela- 
tive to that of X counterparts in therians following S1 differentiation 
(Benjamini-Hochberg-corrected P < 0.05, Mann-Whitney U-test; Ex- 
tended Data Figs 5g and 7a-d), suggesting that male mutation bias 
(that is, a higher mutation rate in males than in females owing to a higher 
number of germline cell divisions), previously reported for eutherians”, 
has shaped genomes of therians since their emergence. 

S2 arose ~117 Myr ago in the eutherian ancestor, after the split from 
the marsupial lineage (Extended Data Figs 4b and 5b), consistent with 
some previous studies” but contrary to others’’. In marsupials an inde- 
pendent 82, containing twice the number of eutherian S2 genes, arose 
in the marsupial ancestor ~37 Myr before the eutherian S2 (Extended 
Data Figs 4e and 5c). Thus, the therian pseudoautosomal region (PAR) 
shrank at different rates in eutherians and marsupials, involving the con- 
vergent differentiation of two genes (KDM5D, UBE1Y]1 (also known as 
UBALY)), as previously proposed”’ (Fig. 1). 

Our analysis of S3, a large autosomal addition to the already estab- 
lished sex chromosomes’ (the ‘Y/X-added region’), shows that it con- 
tained seven genes and was already defined in the common eutherian 
ancestor (Fig. 1 and Extended Data Fig. 4c), as could previously only 
be surmised’®"”. S3 only differentiated just before the placental mam- 
mal radiation ~116 Myr ago, at about the same time or perhaps even 
concomitantly with S2 (for example, as part of the same inversion event; 
Extended Data Fig. 5d), implying that the eutherian MSY consisted 
merely of $1 for ~60 Myr following the eutherian—marsupial split. As 
for S1/S2, S3 gene content has been remarkably stable during evolution 
(Fig. 1). However, curiously, similarly to $1 (HSFY, RPS4Y), rodents and 
marmosets independently lost the same two S3 genes (TMSB4Y, Cyorf15Y), 
whereas TMSB4Y was independently also lost in Laurasiatheria” (Fig. 1). 
It is also noteworthy that the elephant retained six more ancestral $1- 
S3 genes (total 15) than the marmoset (total nine), the eutherian with 
the smallest detected S1-S3 gene repertoire. 

Further analyses show that all genes of S4. and S5, as previously defined"®, 
originated in the common catarrhine (OWM/great ape) ancestor ~25- 
40 Myr ago, except for one gene (TBL1Y), which presumably emerged 
in the common simian ancestor >40 Myr ago (Fig. 1). The marmoset 
Y acquired a unique gene, XG (Fig. 1), which spans the human PAR 
boundary” and remains in the elephant PAR region”. Furthermore, 
AMELY, also residing near the PAR boundary, probably emerged inde- 
pendently in primates and Laurasiatheria~’. MBTPS2Y arose as part of 
an independent S4 in the afrotherian lineage, and MED14Y represents 
a recent acquisition in rats (Fig. 1). Primate Y chromosomes recruited 
additional genes through various (retro)transpositions/translocations 
from autosomes during recent evolution'*"' (Fig. 1). These results illus- 
trate the dynamic recent, partly convergent, evolution of the eutherian Y. 

Although extant gene numbers are small, the above results illustrate 
that many Y genes endured over long evolutionary time periods. Mod- 
elling of Y gene loss dynamics revealed that S1-S3 gene decay proceeded 
rapidly upon differentiation, at similar rates as previously estimated"®, 
and then markedly levelled off when small but apparently essential gene 
repertoires were defined (Extended Data Fig. 8c). 


Monotreme Y biology and evolution 


Gene content and evolution of monotreme Y chromosomes (five in platy- 
pus, four in echidna owing to a Y;-Y; fusion”*) previously remained 
largely unknown, and the X chromosomes of platypus, the only mono- 
treme with an assembled genome”, are only partly reconstructed. To 
assess monotreme sex chromosome evolution, we dated the 25 iden- 
tified monotreme Y protein-coding genes (Fig. 1) and X gametologues. 
Because no X gametologue is assigned to any currently assembled X, 
we confirmed X identity for 23 out of 25 gametologues and assessed pro- 
bable locations on the basis of male/female genomic read coverage and 
synteny mapping (Extended Data Fig. 6, Supplementary Tables 19, 20 and 
Methods), revealing that monotreme gametologues stem from various 
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ancestral synteny blocks (Supplementary Tables 19, 20) that assem- 
bled during evolution through genomic rearrangements’. 

We detected six potential Y chromosome strata, five of which emerged 
in the platypus—echidna ancestor (Fig. 1 and Extended Data Fig. 5h). 
Thus, nearly all (22 out of 25) platypus Y genes differentiated at least 
~50 Myr ago in the common monotreme ancestor (Supplementary Data 1, 
external data link). Importantly, monotreme S1 originated early during 
monotreme evolution ~175 Myr ago (Extended Data Figs 4f and 5e), 
which, together with our dating of therian sex chromosome origination, 
implies two independent yet essentially concomitant sex chromosome 
origination events in mammals, rules out the possibility of a turnover 
of one mammalian sex chromosome system into the other, and raises 
the question of the nature of sex determination in the common mam- 
malian ancestor more than ~200 Myr ago. 

Notably, one of the S1 genes is AMH, encoding the anti-Miillerian hor- 
mone, a key component of sex determination cascades across vertebrates)’, 
which in eutherians blocks the development of female reproductive 
organs upon activation by SOX9, which in turn is activated by SRY’. 
Our observation that AMH is Y-linked in monotremes and part of the 
oldest stratum, S1 (Fig. 1 and Extended Data Fig. 5h), together with find- 
ings that AMH may precede SOX9 expression and act as the primary 
sex-determining trigger in some vertebrates” ”’, render this gene a prime 
candidate for being a principal monotreme sex-determining gene. We 
predicted AMHX to be located on chromosome X, (Supplementary 
Tables 19, 20) and AMHY to be part of Ys, on the basis of our synteny 
approach and the recent mapping” of the $1 gene MED26Y (Fig. 1), 
which similarly to AMHY stems from an ancestral linkage group cor- 
responding to chicken chromosome 28. Notably, we confirmed that 
AMHY is located on Y; by physical mapping (Extended Data Fig. 7e, f 
and Methods). Our results favour a scenario in which a fusion of an 
ancestral chromosome segment containing AMH, presumably to the 
original proto-Y5-Xs, formed the initial monotreme proto-sex chro- 
mosomes. Their differentiation was followed by various translocations/ 
fusions with autosomes” and subsequent differentiation events, which 
ultimately led to the current Y genes, of which only two genes (FEMICY 
and HNRNPKY) were derived from the initial Z-homologous proto-Y; 
portion. Curiously, in parallel to eutherians, monotreme Y chromosomes 
also acquired DAZ and SLY homologous genes (Fig. 1). 

Finally, contrary to our therian observations, we could not detect sig- 
nificantly different ds values between monotreme Y and X gametolo- 
gues (Extended Data Fig. 5g), which indicates that male mutation bias” 
is lacking or limited in monotremes. 


W origin 

Next, we performed the first phylogenetic dating of chicken W strata 
(Fig. 1 and Supplementary Data 1, external data link), taking advantage 
of Z orthologues from recent bird genome sequences*'”” and male/female 
RNA-seq data from ostriches* (allowing the extraction of ostrich W 
genes), representing the most basal avian lineage, ratites. Contrary to 
previous work”, this analysis reveals an initial stratum with two genes 
(HNRNPK, KCMF1) that emerged in the common bird ancestor ~140 Myr 
ago (Extended Data Figs 4g and 5f), whereas the remaining chicken W 
strata seem to have originated later during avian evolution (Fig. 1), 
although a more precise phylogenetic dating of these strata will require 
W sequences from other birds. KCMF1 and HNRNPK, which was inde- 
pendently retained on both the W and monotreme Y (reflecting their 
shared ancestry; Fig. 1), are widely expressed housekeeping genes**”° 
that do not represent obvious sex-determination candidates but rather 
dosage-sensitive genes that were preserved to maintain proto-Z/W gene 
dose. Our results are consistent with a Z dosage-based mechanism of 
sex determination in birds’>*’. 


Functional evolution of Y (W) genes 

To understand why only small specific subsets of 9-25 Y/W protein- 
coding genes have been preserved from the original proto-sex chro- 
mosome repertoires, we assessed whether they possess characteristic 
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functions using simulations, which started with the ancestral sets of 
proto-sex-linked genes and then randomly removed genes until cur- 
rent Y/W gene numbers were reached (Methods). We then compared 
the functions of the simulated gene sets to those observed using Gene 
Ontology (GO) annotations**. Notably, this analysis revealed that highly 
non-random gene sets with similar functions were maintained across 
the different Y/W chromosomes (corrected P < 0.01, one-tailed alpha 
test). Current Y/W genes are enriched for genes involved in transcrip- 
tion/transcription regulation (GO Biological Process) and specific 
DNA binding/transcription factor activity (GO Molecular Function) 
(Supplementary Tables 21, 22), which is consistent with recent observa- 
tions in fruitflies*’ and indicates that current Y genes were, at least initially, 
preserved to maintain ancestral gene dosage’’, given that regulatory 
genes and genes with binding functions are often haploinsufficient®. 

Expression levels of autosomal orthologues of Y/X genes in species 
with different sex chromosome systems, unaffected by sex-related selec- 
tion, may serve to gauge proto-sex chromosome expression levels’. 
Expression levels inferred for proto-sex chromosomal precursors of 
Y/X gametologues using this procedure are higher than those of other 
proto-sex chromosome genes in all amniote lineages (Extended Data 
Fig. 8a), conforming to the above notion that current Y genes derive 
from highly expressed genes with universal (dosage-sensitive) functions. 
Consistently, X/Z gametologues, which are expected to generally pre- 
serve ancestral functions*, show higher expression levels than other 
X/Z-linked genes (Extended Data Fig. 8a). 

Expression levels of therian Y genes decreased since sex chromosome 
differentiation (corrected P< 0.05, Mann-Whitney U-test), with cur- 
rent (median) expression levels that are between 3.1 (opossum) to 15 
(rodents) times lower than inferred ancestral expression levels of single 
proto-sex chromosome alleles (Fig. 2), reflecting partial regulatory decay 
of Y genes and/or evolution towards new functions. Further analyses 
show that many therian Y genes maintained ancestral patterns of usu- 
ally ubiquitous expression, with interesting exceptions such as SRY and 
AMHY, which appear to have been sex-biased already on the proto-sex 
chromosomes (Fig. 3). Nevertheless, a larger number of genes evolved 
new expression patterns on the Y than on the X or autosomes (corrected 
P<0.05, Fisher’s exact test, Supplementary Table 23). All of these Y genes 
evolved testis-specific expression (Fig. 3), consistent with the male-limited 
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transmission of the Y''. Notably, therian Y genes evolved testis specifi- 
city by experiencing substantially stronger expression reduction in somatic 
tissues than in testis (P < 0.05, Mann-Whitney U-test; Extended Data 
Fig. 8b). Thus, testis specificity evolved through differential regulatory 
decay rather than upregulation in testis, as observed in fruitflies. Con- 
trary to therians, platypus Y and chicken W genes show no overall expres- 
sion reduction relative to their proto-sex precursors (Fig. 2), and most 
of them also preserved ancestral spatial expression patterns (Fig. 3). Thus, 
Y/W expression preservation is particularly pronounced in monotremes 
and birds. 

To explore cellular functions of therian testis-specific Y genes, we used 
the mouse as a model, assessing expression patterns of mouse Y genes 
across all major testicular cell types using dedicated data*’. Contrary 
to ubiquitously expressed Y genes, testis-specific Y transcripts are re- 
stricted to three germ-cell types: mitotic germ cells (spermatogonia), 
meiotic cells (spermatocytes) and, especially, post-meiotic cells (round 
spermatids) (Extended Data Fig. 9a). Although the high expression levels 
of these genes in spermatids may partly reflect ‘promiscuous’ tran- 
scription, facilitated by overall open chromatin“, it may also indicate 
post-meiotic functions, consistent with the finding that the spermatid- 
specific SLY gene (Extended Data Fig. 9a) is a sex chromosome regu- 
lator in spermatids*”. Detected noncoding and pseudogenic Y transcripts 
are often testis specific across species and sometimes have many genomic 
copies (Supplementary Tables 5-17). Notably, nearly all noncoding mouse 
Y transcripts are specifically expressed in spermatids/spermatocytes 
(Extended Data Fig. 9b), probably mainly due to the permissive chromatin 
environment*’. The lower abundance of transcripts in spermatocytes 
relative to spermatids is presumably due to meiotic sex chromosome 
inactivation, which counteracts transcriptional promiscuity*’. 


Overall Y evolution and selective pressures 

To further characterize selective signatures of Y evolution, we calcu- 
lated branch-specific rates of nonsynonymous (dy) and synonymous 
(ds) substitutions, which revealed substantially increased dy/dg values 
for Y/W genes compared to both X/Z gametologues and autosomal 
precursors, consistent with previous work*** (Extended Data Fig. 10a). 
Y branch dy/dg is always statistically significantly below 1 and we find no 
evidence for positive selection, neither for the most basal Y gene branches 
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Figure 2 | Expression level evolution on amniote sex chromosomes. 

a-d, Expression level distributions (based on medians across somatic tissues or 
testis) of Y genes (Y); X genes in males (Xm) and females (Xf); precursors of X/Y 
genes on proto-sex chromosomes in males (Pm) and females (Pf). e, Similar 
distributions for (proto-)sex chromosomes in chicken. Note: for proto-sex 
chromosome plots, inferred expression output values were calculated per single 
gene copy/allele, to assess conservation of ancestral expression levels in current 
single Y (W) chromosomes. Significant differences (Mann-Whitney U-test): 


Y genes sorted 


Y genes sorted W genes sorted 


Benjamini—Hochberg-corrected *P < 0.05, **P < 0.01. Error bars, maximum 
and minimum values, excluding outliers. f-j, Median current/ancestral 
expression level ratios for individual genes. Error bars, 95% confidence 
intervals. Ratios are plotted on a log) scale, that is, a ratio of 0 (non-log ratio 
of 1, red line) indicates that current and ancestral expression levels are similar. 
R? statistics represents the best fit to a third-order exponential curve. Gene 
numbers (7) underlying the data are indicated. 
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nor when combining descendant branches following gametologue dif- 
ferentiation (Methods). Thus, although some Y genes may have been 
shaped by positive selection in certain species’***, Y chromosome evo- 
lution is generally characterized by relaxed purifying selection, consis- 
tent with the impaired selection on the MSY”****. Thus, Y genes probably 
generally preserved ancestral protein functions, consistent with the 
observation that even SRY may be functionally replaced by its X coun- 
terpart and functional precursor SOX3 (ref. 47) and that most X genes 
likely underlying Turner syndrome (impaired development due to X 
chromosome monosomy) not only escape X inactivation and are under 
strong purifying selection but also have conserved Y gametologues”. 
Together with our expression and simulation results, these considera- 
tions suggest that most Y genes were, at least initially, preserved because 
of (regulatory) dosage constraints. Some Y genes then evolved new func- 
tions in spermatogenesis or development, primarily through temporal and/ 
or spatial expression shifts, as exemplified by the therian sex-determining 
gene, SRY. 


METHODS SUMMARY 


We collected Illumina RNA-seq data (polyadenylated RNA fraction) for 166 tissue 
samples from nine mammals and two birds, and high-throughput Illumina paired- 
end genomic sequencing data for males and females from nine mammals and chicken. 
We developed a subtraction approach based on male/female RNA-seq data, Illumina 
genomic data and available genomes to identify and assemble Y (W) transcripts. 
The approach was validated using large-scale PCR and Sanger sequencing. We used 
genomic read coverage to predict multi-copy Y (W) genes, and male/female read 
coverage differences to identify X-linked genes in platypus. Phylogenetic tree recon- 
struction of sex-chromosome-linked genes and 1:1 autosomal orthologues from 
outgroup species, and associated synonymous substitutions rate analyses, were 
used to estimate ages of sex chromosome stratification events. We performed Monte 
Carlo simulations to assess non-random overlaps of Y (W) gene functions across 
different sex chromosome systems. RNA-seq read mapping and expression level 
estimation were performed using standard procedures and reference genomes to 
which newly identified Y (W) genes were added. To infer ancestral expression levels 
of Y (W) and X (Z) genes, we used 1:1 autosomal orthologues from outgroup species 
with different sex chromosome systems. All newly established Y and W sequences 
are provided in Supplementary Data 1. 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Mammalian Y chromosomes retain widely 
expressed dosage-sensitive regulators 
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The human X and Y chromosomes evolved from an ordinary pair of autosomes, but millions of years ago genetic decay 
ravaged the Y chromosome, and only three per cent of its ancestral genes survived. We reconstructed the evolution of the 
Y chromosome across eight mammals to identify biases in gene content and the selective pressures that preserved the 
surviving ancestral genes. Our findings indicate that survival was nonrandom, and in two cases, convergent across 
placental and marsupial mammals. We conclude that the gene content of the Y chromosome became specialized through 
selection to maintain the ancestral dosage of homologous X-Y gene pairs that function as broadly expressed regulators of 
transcription, translation and protein stability. We propose that beyond its roles in testis determination and spermato- 
genesis, the Y chromosome is essential for male viability, and has unappreciated roles in Turner’s syndrome and in phe- 


notypic differences between the sexes in health and disease. 


The human X and Y chromosomes evolved from autosomes over the 
past 300 million years’. Only 3% of ancestral genes survive on the human 
Y chromosome””’, compared to 98% on the X chromosome’. Y-chromosome 
decay was initially rapid but has virtually halted over the last 25 million 
years, leaving a stable set of ancestral genes’. Mathematical models of 
Y-chromosome decay assume all ancestral genes are equally likely to 
survive. However, our initial studies of the human Y chromosome sug- 
gested that its gene content is functionally coherent’, leading us to ask 
whether mammalian Y chromosomes preferentially retained a subset 
of ancestral genes, and, ifso, what qualities these surviving genes share. 

Our earlier analyses® of the human Y chromosome were hampered 
by limited knowledge of the gene content of the ancestral autosomes. 
Our recent cross-species comparisons enabled us to reconstruct their gene 
content and identify acquired genes on the X and Y chromosomes. The 
human X chromosome acquired and amplified testis-expressed gene 
families™*. Similarly, our comparisons of the human, chimpanzee and 
rhesus Y chromosomes indicated recent acquisition and amplification 
of testis-specific genes***. Thus, both the human X and Y chromosomes 
gained a specialization for male reproduction by acquiring genes that 
were not present on the ancestral autosomes”~*. 

We excluded acquired genes to independently examine ancestral Y- 
linked genes for characteristics that distinguished surviving genes from 
genes lost to decay. Because the human, chimpanzee and rhesus Y chro- 
mosomes share nearly identical ancestral gene content, we analysed five 
additional mammals to enhance our ability to detect biases in the decay 
and survival of ancestral genes. We produced finished sequence of the 
ancestral portions of the Y chromosomes of marmoset (Callithrix jac- 
chus), mouse (Mus musculus), rat (Rattus norvegicus), bull (Bos taurus) 
and opossum (Monodelphis domestica) and compared them to the pub- 
lished sequences of the human, chimpanzee (Pan trogolodytes) and rhe- 
sus macaque (Macaca mulatta) Y chromosomes, all eight corresponding 
X chromosomes and the orthologous chicken (Gallus gallus) autosomes 
as an outgroup to mammalian X and Y chromosomes. Using this ex- 
panded tree of species, we reconstructed the evolution of mammalian 


Y chromosomes from their origin to the present. We concluded that 
surviving Y-linked genes form a functionally coherent group enriched 
for dosage-sensitive, broadly expressed regulators of transcription, trans- 
lation and protein stability. 

We produced finished sequence using the SHIMS (single-haplotype 
iterative mapping and sequencing) strategy we employed on primate Y, 
human X and chicken Z chromosomes (Methods). These sequences 
comprise 17 megabases (Mb) and are accurate to about 1 nucleotide per 
0.3 Mb (Supplementary Table 1, Extended Data Fig. 1 and Methods). 
To identify ancestral X-Y gene pairs, we searched for Y-homologues 
of protein-coding genes we had identified as ancestral (Supplementary 
Tables 2 and 3)*°. We validated each putative gene by verifying tran- 
scriptional activity (Extended Data Fig. 2) and comparing its open read- 
ing frame to its chicken orthologue (Supplementary Data 1 and 2). We 
identified 36 different ancestral X-Y gene pairs across all eight species, 
adding 18 ancestral X-Y gene pairs to the 18 known to be present on 
the human, chimpanzee and rhesus Y chromosomes (Fig. 1). 


Regulatory functions of X-Y gene pairs 


Seventeen years ago, we characterized human X-Y gene pairs as spe- 
cialized in cellular housekeeping functions®. Since then, annotation of 
the human genome has increased in detail and completeness. We there- 
fore revisited the question of functional coherence and found evidence 
that X-Y pair genes perform an array of regulatory functions (Fig. 2). 
Based on annotations of their X homologues, ancestral Y-linked genes 
appear to regulate each stage of the central dogma: histone lysine demeth- 
ylases KDM5D (H3K4) and UTY (H3K27); the transcription factor ZFY, 
regulating stem-cell self-renewal; spliceosomal component RBMY; trans- 
lation initiation factors DDX3Y and EIF1AY; and the deubiquitinase 
USP9Y (Fig. 2). Compared to other ancestral genes that survive on the 
X chromosome, X-Y pair genes are enriched for annotations such as 
nucleic-acid binding, transcription and translation (Extended Data Table 1, 
Methods and Supplementary Table 4), suggesting that X-Y pair genes 
can govern expression of targets throughout the genome. 
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University School of Medicine, St. Louis, Missouri 63108, USA. 3Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA. 
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Figure 1 | Ancestral Y-linked genes by species and human X homologue 
location. Ancestral Y-linked genes (filled circles) and pseudogenes (open 
circles) listed by the position of their X-linked homologue on the human X 
chromosome. The placental-specific added region (red bar) and the conserved 
region shared with marsupials (blue bar) of the sex chromosomes are indicated 
on the left. Human sex chromosome evolution was punctuated by formation 
of at least 4 evolutionary strata (light blue, green, yellow and orange); other 
strata formed independently in opossum (purple) and marmoset (red). Myr, 
million years. 


Convergent survival of X-Y gene pairs 


To gain insight into the decay and survival of ancestral genes, we recon- 
structed Y chromosome evolution, taking advantage of our earlier dis- 
covery that synonymous nucleotide divergence between the X and Y 
sequences of each gene pair increases in stepwise fashion along the human 
X chromosome’””. This suggested a series of discrete events, most likely 
inversions on the Y chromosome, that suppressed X-Y crossing over 
in a single region, or ‘stratum,’ without disturbing gene order on the X 
chromosome’”. We used the 36 X-Y gene pairs to recalibrate previous 
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Figure 2 | Regulatory annotations of X-Y pair genes. Venn diagram 
depicting regulatory functions predicted for selected X-Y pair genes on basis of 
UniProt annotations of human X-homologue. Common alternatives to official 
gene symbols in parentheses. 


reconstructions of evolutionary strata (Extended Data Table 2, Extended 
Data Figs 3-5, Methods and Supplementary Tables 2 and 5). In broad 
agreement with previous reconstructions’ *”"°, we concluded that the 
human Xand Y chromosomes evolved from ordinary autosomes through 
chromosomal fusion and formation of at least four strata (Fig. 3 and 
Methods). 

Our results indicate that the stratum containing UBE1Y and KDM5D 
formed independently in the placental and marsupial lineages (Extended 
Data Fig. 4). The same set of ancestral genes became subject to genetic 
decay in each lineage, forming replicates of the same natural experi- 
ment. Out of the 184 ancestral genes shared between these strata, nine 
survived on the Y chromosome in marsupials, and three survived in 
placental mammals, but both lineages retained UBE1Y and KDM5D 
(Fig. 1, Supplementary Table 2 and Methods). The convergent survival 
of two ancestral genes is unlikely to occur under a model where genes 
survive genetic decay at random (one-tailed Fisher’s exact test, P< 
6.25 X 107°). 


Remarkable longevity of X-Y gene pairs 


Using these recalibrated evolutionary strata, we re-examined the kin- 
etics of genetic decay among ancestral Y-chromosome genes. Analysis 
of primate Y chromosomes had led us to conclude that, within a stra- 
tum, rapid gene loss was followed by stabilization at a baseline set of 
genes”. With five more divergent mammals, we doubled the constraints 
on the kinetics of gene loss during human Y chromosome evolution 
(Fig. 4 and Methods) and traced the stability of human Y-chromosome 
genes to the origin of mammals (Fig. 4). We infer that 97 million years 
ago, the Y chromosome of the common ancestor of placental mam- 
mals carried 18 ancestral genes from stratum 1 and stratum 2/3 (Fig. 1). 
Of those 18 genes, 14 survive in the human lineage (Fig. 1), and none 
have been lost in the last 44 million years (Fig. 4). We also examined 
whether ancestral Y-linked genes were stable in marsupials. Recent ana- 
lyses of the tammar wallaby (Macropus eugenii) Y chromosome iden- 
tified ten genes shared with the Tasmanian devil (Sarcophilus harrisii)"; 
we observe that all are ancestral and survive in the opossum. This sug- 
gests the opossum lineage maintained these genes over the last 78 mil- 
lion years’. We conclude that in both placental and marsupial lineages, 
some ancestral X-Y gene pairs were remarkably long lived despite rapid 
decay of surrounding genes. 


Two strategies preserved Y-linked genes 


In light of the regulatory annotations of X-Y gene pairs, convergent 
survival of X-Y gene pairs in the placental and marsupial lineages, and 
the longevity of ancestral X-Y gene pairs across mammals, we sought 
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Figure 3 | Reconstruction of human sex chromosome evolution. Major 
events in the evolution of the human sex chromsomes are labelled with 
approximate dates in Myr. After SRY evolved, at least 4 evolutionary strata 
(light blue, green, yellow and orange) formed in the lineage leading to the 
human Y chromosome. Each stratum expanded the MSY (male-specific 


the evolutionary pressures that drove their survival. We had previously 
speculated that biases in the gene content of the human Y chromosome 
could arise through two evolutionary strategies: retention and amp- 
lification of testis-specific gene families, and conservation of ancestral 
X-Y gene pairs to maintain comparable expression between males and 
females®. Using the set of 639 ancestral genes reconstructed through cross- 
species comparisons of the human X chromosome and orthologous chick- 
en autosomes”**, we tested whether these hypotheses account for the 
36 ancestral X-Y pair genes found on eight present-day Y chromosomes. 

The Y chromosome was predicted to accumulate genes that enhance 
male reproductive fitness’*, which depends upon sperm production in 
the adult testis. In each species we studied, ancestral genes that are ampli- 
fied into multi-copy families are expressed exclusively or predomi- 
nantly in the testis (Extended Data Fig. 2). However, many such genes 
have broadly expressed single-copy homologues on orthologous chicken 
autosomes, on mammalian X chromosomes, and in cases like DDX3Y, 
EIFIAY, UBE1Y and ZFY, on other Y chromosomes (Extended Data 
Fig. 2 and Supplementary Table 2). This suggests that adoption of testis- 
specific function preceded gene amplification. 

In light of evidence that intrachromosomal gene conversion preserved 
testis-specific gene families in primate Y-chromosome palindromes”, 
we speculated that gene amplification contributed to longevity. We ranked 
surviving Y-linked genes by total branch length across our tree of eight 
species (Fig. 5a)'”. Genes that are amplified in at least one species have 
a significantly greater branch length than those that are single copy 
in every species (one-tailed Mann-Whitney U-test, P< 4.27 X 10°) 
(Fig. 5a). This correlation remains robust when the opossum lineage, 
with a large number of unique single-copy genes, is excluded (one-tailed 
Mann-Whitney U-test, P< 5.54 x 10 *). Gene families in tandem 
arrays show high intraspecies identity and interspecies divergence, a 
sign that gene conversion is more frequent than mutation in these struc- 
tures (Extended Data Fig. 6). Two pairs of Y-linked genes, RPS4Y1 
and RPS4Y2 in primates and Zfy1 and Zfy2 in mouse, are exceptions. 
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region of the Y, deep blue) at the expense of the PAR (pseudoautosomal 
region, grey). Genetic decay eliminated most genes from MSY. A chromosomal 
fusion extended the PAR, generating conserved (XCR/YCR) and added 
(XAR/YAR) regions. 


Both are physically dispersed and show no sign of recent Y-Y gene 
conversion (Extended Data Fig. 6). We conclude that genes specialized 
for male reproduction avoided genetic decay through intrachromoso- 
mal gene conversion among members of a Y-linked, multi-copy gene 
family. 

Next, we examined whether single-copy genes on the Y chromo- 
some survived owing to selection to preserve the correct dosage of broadly 
expressed genes critical to both sexes**"°. Most genes on the Y chromo- 
some were lost to genetic decay, and the X chromosome evolved mech- 
anisms to compensate for the lost dosage of Y-linked genes in males*'*”. 
The Y chromosome might preferentially retain genes for which the tran- 
sition state of this process, with a non-functional Y-linked gene and 
a functional but non-dosage-compensated X-linked homologue, was 
disadvantageous. Dosage-sensitive genes functioning in many Uissne 
and cell types might be particularly sensitive to these pressures’*. We 
re-analysed published data sets for evidence that our set of 36 X-Y pair 
genes systematically differ from the 603 other ancestral genes on the X 
chromosome with regard to dosage sensitivity’*”°, breadth of expression” 
and intensity of purifying selection”. 

We examined whether X-Y pair genes show signs of dosage sens- 
itivity. In humans, gene-by-gene estimates predict a greater likelihood 
of haploinsufficiency’® for ancestral X-linked genes with surviving Y 
homologues compared to those lacking Y homologues (one-tailed Mann-— 
Whitney U-test, P< 6.59 X 10° *) (Fig. 5b). If surviving X-Y gene pairs 
maintain ancestral gene dosage, then X-linked genes with surviving 
Y-linked homologues should escape X inactivation. In human”’, mouse”, 
and opossum”, data on allele-specific expression in females is inform- 
ative for a subset of ancestral genes (Supplementary Table 2). In each 
species, a higher proportion of X-linked genes with surviving Y-linked 
homologues escape X-inactivation compared to those without surviv- 
ing Y-linked homologues (Supplementary Table 2), and X-Y gene pairs 
in which the X-homologue is subject to X-inactivation have Y-homologues 
that show signs of functional differentiation. In humans, 12 of 14 informative 
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Figure 4 | Decay of Y-linked genes to a baseline level. Gene numbers (on a 
log scale on the y axis) plotted versus time (in Myr before present (Myr BP) on 
the x axis). Filled circles show inferred or observed gene numbers in (from 
left to right) Ancestral X-Y genes (before stratum formation), the MSY of 
common ancestor of human and opossum (176 Myr Bp), bull (97 Myr Bp), 
mouse and rat (91 Myr Bp), marmoset (44 Myr Bp), rhesus (30 Myr Bp) and 
chimpanzee (6 Myr Bp), and modern human MSY. Lines represent best-fit 
curves to data points using alternate models of decay. Exponential decay to a 
constant baseline provides the best fit; shaded regions represent parameters 
producing an equally good fit. 


X-Y pair genes escape X inactivation, but only 168 of 385 remaining 
ancestral X genes escape (one-tailed Fisher’s exact test, P< 1.89 X 10 °) 
(Supplementary Table 2). The two exceptions, TSPY and RBMY, are 
amplified into testis-specific gene families (Extended Data Figs 2 and 6). 
In mouse, in which X chromosome inactivation is more complete”, 
four of nine informative X-Y pair genes escape X inactivation, whereas 
only five of 344 remaining ancestral genes escape (one-tailed Fisher’s 
exact test, P< 2.36 X 10°) (Supplementary Table 2). All five exceptions 
(Sry, Romy, Ubely, Usp9y and Zfy) evolved testis-specific expression 
in mouse (Extended Data Fig. 2). Despite differences in the mechan- 
isms of X inactivation between placental and marsupial mammals, all 
eight informative opossum X-Y pair genes escape X inactivation, but 
only 15 of 138 remaining ancestral genes escape (one-tailed Fisher’s exact 
test, P< 1.17X 10”) (Supplementary Table 2). 

The Turner’s syndrome phenotype (classically associated with a 45, 
X karyotype, or monosomy X) suggests a strict dosage requirement for 
one or more sex-linked genes in humans. If dosage of X-Y pair genes is 
partially responsible for the Turner’s syndrome phenotype, it could explain 
the differing features of monosomy X in humans and mice. Monosomy 
X in humans results in poor in utero viability. Less than 1 in 100 45,X 
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Figure 5 | Factors in the survival of Y-linked genes. Violin plots, white bar, 
interquartile range; circle, median value; asterisk, significant difference in one- 
tailed Mann-Whitney U-test. a, Multi-copy genes (n = 9) have greater 
longevity than single-copy genes (n = 27) (P< 4.28 X 10° °). b, X-Y pair genes 
(n = 32) have higher haploinsufficiency probability than other ancestral 

X genes (n = 478) (P< 6.59 X 10 *).¢, X-Y pair genes (n = 28) have broader 
expression across human tissues than other ancestral X genes (n = 383) 

(P< 2.20 X 10). d, X-Y pair genes (n = 27) have lower dy/ds ratio than other 
ancestral X genes (n = 489) (P< 3.39 X 10 *). 


conceptuses survive to term”*”*. Those that do survive are often mosaic 
for all or part of a second sex chromosome”, so that variability in the 
Turner’s syndrome phenotype may reflect variability in dosage of X-Y 
pair genes among tissues as well as individuals. The mouse phenotype 
of monosomy X is less severe; animals are small but viable and have 
reduced fertility" °°. This milder phenotype may reflect a dearth of genes 
on the mouse X chromosome that require two doses: only nine ances- 
tral genes survive on the mouse Y chromosome (compared to 17 in human), 
and fewer X-linked genes escape inactivation. 

Finally, human X-linked intellectual disability syndromes provide 
evidence for the dosage sensitivity of specific X-Y pair genes. UTX (also 
known as KMD6A), KDM5C and NLGN4X all have Y homologues, escape 
X inactivation, and appear to be haploinsufficient (Supplementary Table 2). 
Mutations in UTX cause Kabuki syndrome; both duplications and dele- 
tions result in multiple congenital anomalies and intellectual disability 
in males and females’. KDM5C is associated with X-linked intellec- 
tual disability in hemizygous males, and heterozygous females with mild 
intellectual disability have been reported in several families*’. In both 
hemizygous males and heterozygous females, NLGN4X mutations are 
associated with autism spectrum disorders and learning disabilities 
reminiscent of the cognitive and behavioural phenotypes of Turner’s 
syndrome’*’. Even the human X-homologues of X-Y gene pairs iden- 
tified only in the opossum (HCFC1, HUWE]1 and MECP2) still display 
acute sensitivity to gene dosage. In humans each of these X-linked genes 
has no Y homologue and is subject to X inactivation”? (Supplementary 
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Table 2). Nevertheless, a non-coding mutation causing overexpression 
of HCFC1, as well as duplications of HUWE1 and MECP2, have been 
implicated in X-linked intellectual disability in human males****. Thus, 
even though the human Y-homologues of HCFC1, HUWEI and MECP2 
were lost and the surviving X-homologues have evolved dosage com- 
pensation, their gene dosage remains tightly constrained. 

X-Y pair genes functioning across many tissues and cell types may 
face additional selective constraints that prevent both loss of the Y- 
linked gene and evolution of a dosage-compensated gene on the X 
chromosome. In all eight species, single-copy Y-linked genes are broadly 
expressed across adult tissues (Extended Data Fig. 2), with two major 
exceptions, in which both members of the X-Y pair share ancestrally 
restricted expression. AMELY, whose orthologue disappeared in the 
toothless avian lineage, is expressed only in developing tooth buds”; 
and HSFY, which is testis specific, and has a chicken orthologue that is 
predominantly expressed in testis. In chicken, the autosomal ortho- 
logues of mammalian X-Y pair genes have significantly broader expres- 
sion across adult tissues than do the orthologues of ancestral genes that 
survive only on the X chromosome, and X-Y pair genes maintain this 
broader expression across mammals (one-tailed Mann-Whitney U-test, 
chicken P < 3.38 X 10°; human P< 2.20 X 10 *; rhesus P< 1.39 X 
10’; mouse P< 4.74 X 10° *; rat P< 4.63 X 10%; bull P< 1.20 x 
10” °) (Fig. 5c and Supplementary Table 2). This breadth of expression 
extends to the earliest stages of development. Relative to other X-linked 
ancestral genes on the X chromosome, X-Y pair genes are enriched for 
genes upregulated after the onset of zygotic gene activation in a time 
course of human, mouse and bovine pre-implantation development 
(one-tailed Fisher’s exact test, human P < 2.13 X 10 *;mouse P < 5.93 X 
10° *; bull P< 1.37 X 10 *) (Supplementary Table 2). X-Y pair genes 
are more broadly expressed than other ancestral genes that survive on 
the X chromosome, across many tissues and developmental time. 

Unlike the testis-expressed multi-copy gene families, the broadly ex- 
pressed, dosage-sensitive single-copy genes of the Y chromosome can- 
not avoid genetic decay through intrachromosomal gene conversion, 
and must rely on purifying selection. Our previous survey of human 
sequence variation among the single-copy genes on the Y chromo- 
some showed that natural selection operated effectively to preserve the 
amino acid sequences of Y-linked genes in the human lineage”*. If X-Y 
gene pairs are haploinsufficient, alleles that alter the function of the 
X-linked homologues should be detrimental in both males and females. 
We examined Ensembl human-mouse orthologue alignment data for 
evidence that the X-linked homologues of X-Y gene pairs were subject 
to strong purifying selection”’. Relative to other ancestral genes on the 
X chromosome, the X-linked homologues of X-Y gene pairs have a 
reduced ratio of non-synonymous to synonymous substitution rates 
(dN/dS) (one-tailed Mann-Whitney U-test, P< 3.39 X 10 *) (Fig. 5d). 
We conclude that these broadly expressed, dosage-sensitive X-Y pair 
genes are under more intense purifying selection than their neighbours 
on the X chromosome. 


Human Y genes ensure male viability 


We conclude that the longevity of many Y-linked genes is due to selec- 
tion to maintain expression, in males, of dosage-sensitive, broadly ex- 
pressed X-Y gene pairs at levels comparable to their autosomal ancestors. 
This model predicts that members of surviving single-copy X-Y gene 
pairs should be functionally interchangeable. Indeed, the human Y-linked 
genes RPS4Y1 and DDX3Y are functionally interchangeable with their 
X homologues in vitro”, and although the histone demethylase domain 
of the mouse Y-linked gene Uty appears to be inactive, mouse Utx and 
Uty are functionally redundant during mouse embryonic development. 

Previous observations suggest that the selective pressures that main- 
tained these Y-linked genes remain strong in the human lineage; about 
99% of human 45,X conceptuses are inviable, and those that survive to 
term are often mosaic for all or part of a second sex chromosome””’. 
Therefore, we also conclude that the broadly expressed, dosage-sensitive 
genes of the human Y chromosome—along with their X-homologues, 
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which escape X chromosome inactivation—are collectively haplolethal. 
We propose that, as a set, these dozen Y-linked genes are essential for 
the viability of 46,XY fetuses (Methods and Extended Data Fig. 7). Thus 
we propose a third organismal function of the human Y chromosome: 
that it carries single-copy genes that ensure male viability. This is dis- 
tinct from the human Y chromosome’s more widely appreciated roles in 
testis determination through SRY and sperm production through amp- 
liconic gene families. 


Sex differences in health and disease 


All of the myriad differences between human males and females—from 
anatomy to disease susceptibility—arise from differences in the genes 
of the X and Y chromosomes that appeared as these chromosomes 
diverged in gene content from their autosomal ancestors. Of the 17 
surviving ancestral genes on the human Y chromosome, four (SRY, 
RBMY, TSPY, and HSFY) have clearly diverged in function from their 
X homologues (SOX3, RBMX, TSPX and HSFX) to play male-specific 
roles in reproductive development or gametogenesis. Because all genes 
on the Y chromosome were exposed to selection only in males, even 
widely expressed ancestral genes may exhibit subtle functional differ- 
ences from their X-linked homologues. Particularly worthy of consid- 
eration are eight global regulators of gene activity that exist as X-encoded 
and Y-encoded (male-specific) protein isoforms in diverse human tis- 
sues: UTX/UTY, EIF1AX/EIFIAY, ZFX/ZFY, RPS4X/RPS4Y1, KDM5C/ 
KDM5D, DDX3X/DDX3Y, USP9X/USP9Y and TBL1X/TBLIY. These 
exemplify a fundamental sexual dimorphism, at a biochemical level, 
throughout the human body, that derives directly from genetic differ- 
ences between the X and Y chromosomes. It will surely be of interest to 
determine whether this dimorphism has a role in diseases, outside the 
reproductive tract, that occur with greater frequency or severity in males 
or females. 


METHODS SUMMARY 


We used the SHIMS (single-haplotype iterative mapping and sequencing) strat- 
egy to assemble a path of sequenced clones for each organism (Methods). Contigs 
were ordered and oriented by radiation hybrid mapping using RHMAPPER 1.22 
(ref. 44) and extended metaphase and interphase FISH, as previously described*’. 

We validated transcription of predicted genes by reverse-transcriptase poly- 
merase chain reaction and capillary sequencing, as well as 454 sequencing of testis 
complementary DNA (cDNA), as previously described’. 

We relied on Ensembl’ version 70 to identify 1:1 orthologues between human, 
chimpanzee, rhesus, marmoset, mouse, rat and bull X chromosomes, the opossum 
X chromosomes and autosomes, and chicken autosomes, but manually reviewed 
cases where simple 1:1 orthologues were not clear**. Within each stratum, we iden- 
tified X-Y pair genes as ancestral if their X-linked or autosomal orthologues were 
syntenic in an outgroup. 

For each species, we aligned each X-Y pair and calculated dy, ds, and dy/ds 
using PAML* to identify evolutionary strata. For cross-species phylogenetic ana- 
lysis, we generated multiple alignments in MUSCLE” and used these alignments 
to generate a tree with 100 bootstrap replicates using DNAML in PHYLIP®. Within 
each stratum, we modelled gene loss as previously described®. 

To calculate longevity, we summed all branch lengths in the most parsimo- 
nious tree from each of the species where a gene is present to the last common an- 
cestor before stratum formation. 

We mapped published functional annotation data'*”* onto our set of ancestral 
genes. We identified pre-implantation expressed genes as previously described”. For 
expression breadth”’, we normalized the expression of each X-linked gene to the high- 
est reads per kilobase per million reads (RPKM) in any tissue, and took the average 
expression across all tissues. We used PANTHER” to calculate the enrichment 
of Gene Ontology terms in X-Y gene pairs relative to the ancestral X, and used 
UniProt annotations” to identify X-Y pair genes involved in regulatory processes. 
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A quantum computer can solve hard problems, such as prime factoring”, 
database searching** and quantum simulation’, at the cost of need- 
ing to protect fragile quantum states from error. Quantum error 
correction® provides this protection by distributing a logical state 
among many physical quantum bits (qubits) by means of quantum 
entanglement. Superconductivity is a useful phenomenon in this 
regard, because it allows the construction of large quantum circuits 
and is compatible with microfabrication. For superconducting qubits, 
the surface code approach to quantum computing’ is a natural choice 
for error correction, because it uses only nearest-neighbour coupling 
and rapidly cycled entangling gates. The gate fidelity requirements 
are modest: the per-step fidelity threshold is only about 99 per cent. 
Here we demonstrate a universal set of logic gates in a superconduc- 
ting multi-qubit processor, achieving an average single-qubit gate 
fidelity of 99.92 per cent and a two-qubit gate fidelity of up to 99.4 
per cent. This places Josephson quantum computing at the fault- 
tolerance threshold for surface code error correction. Our quantum 
processor is a first step towards the surface code, using five qubits 
arranged ina linear array with nearest-neighbour coupling. As a fur- 
ther demonstration, we construct a five-qubit Greenberger-Horne- 
Zeilinger state*’ using the complete circuit and full set of gates. The 
results demonstrate that Josephson quantum computing is a high- 
fidelity technology, with a clear path to scaling up to large-scale, fault- 
tolerant quantum circuits. 

Fault tolerance in the surface code is achieved by placing physical 
qubits in a chequerboard pattern, with white squares representing data 
and black squares representing measurement qubits that detect errors. 
To perform this detection, each measurement qubit needs to interact 
with its four neighbouring data qubits. All that is needed for these inter- 
actions are single- and two-qubit gates with sufficiently high fidelity. 
The high fidelity demonstrated here is achieved through a combination 
of coherent qubits, a straightforward interconnection architecture and 
a novel implementation of the two-qubit controlled-phase entangling 
gate. The controlled-phase gate uses a fast but adiabatic qubit frequency 
tuning that minimizes error’. 

Here the tuneable nature of the qubits and their entangling gates pro- 
vides both high fidelity and fast control. Previous demonstrations of 
two-qubit gates achieving >99% fidelity used fixed-frequency qubits: 
systems based on nuclear magnetic resonance and ion traps have shown 
two-qubit gates with fidelities of 99.5% (ref. 11) and 99.3% (ref. 12). 
Recently, for a five-qubit ion trap'’ and a three-qubit superconducting 
system“, two-qubit entangling gate fidelities of 95% and 96% were reported. 

Superconductivity allows for the construction of large quantum inte- 
grated circuits as the electrons are condensed into a single macroscopic 
quantum state. We have designed a processor to test our ability to imple- 
ment the surface code; it consists of five cross-shaped transmon qubits 
(Xmons) with nearest-neighbour coupling, arranged in a linear array 
(Fig. 1). The Xmon qubit"’ offers a nodal approach to connectivity while 


maintaining a high level of coherence (see Supplementary Information 
for decoherence times). Here the four legs of the cross allow for a natural 
segmentation of the design into coupling, control and readout. We choose 
a modest inter-qubit capacitive coupling strength of g/2n = 30 MHz 
and use alternating qubit idle frequencies of 5.5 and 4.7 GHz, enabling 
a controlled-phase gate in 40 ns when two qubits are brought near res- 
onance, while minimizing the effective coupling to 0.3 MHz when the 
qubits are at their idle points. Rotations around the X and Y axes in the 
Bloch sphere representation are performed using pulses on the micro- 
wave (XY) line, whereas Z-axis rotations, which control the phase of the 
quantum state, are achieved by a flux-bias current on the frequency- 
control (Z) line. We use a dispersive measurement method'® whereby 
each qubit is coupled to a readout resonator with a distinct resonance 
frequency, enabling simultaneous readout using frequency-domain mul- 
tiplexing through a single coplanar waveguide’’. The modularity of this 
architecture makes it straightforward to integrate more qubits in the circuit. 

We characterize our gate fidelities using Clifford-based randomized 
benchmarking"’’*””. The Clifford group is a set of rotations that evenly 
samples the Hilbert space, and the benchmarking thus averages across 
errors. For the single-qubit case, the Clifford gates (which we hence- 
forth refer to simply as Cliffords) comprise 7, 1/2 and 27/3 rotations 
(Supplementary Information). In randomized benchmarking, a logic 
gate is characterized by measuring its performance when it is inter- 
leaved with many random sequences of gates, making the measured 
fidelity resilient to state preparation and measurement errors. We first 
perform a control experiment on a ground-state qubit by generating a 
random sequence of m Cliffords; appending the unique recovery Clif- 
ford (C,) that inverts the sequence; and averaging the experimental 
sequence fidelity, the final ground-state population, over k different 
sequences'””°. The resulting reference sequence fidelity, F,g is fitted to 
Free = Apref” + B, where preg is the sequence decay, and state prepara- 
tion and measurement errors are captured in the parameters A and B. 
The average error per Clifford of the reference is given by reg = (1 — 
Pret)\(d — 1)/d, with d= 2N»s, We then measure the fidelity of a spe- 
cific gate by interleaving it with m random Cliffords. The sequence decay, 
Peate gives the gate error, gate = (1 — Peate/Pret)(d — 1)/d. 

The benchmarking results for the single-qubit gates are shown in 
Fig. 2. We generate the Cliffords using microwave pulses, from a basis 
set of m and n/2 rotations around the X and Y axes (Supplementary 
Information). We benchmark X- and Y-axis nm and 1/2 rotations, the 
Hadamard gate (implemented with a 1/2 Y rotation (Y/2) followed by 
a 1 X rotation (X)) and Z-axis rotations. From the data in Fig. 2, we 
extract the individual gate fidelities listed in the key. We find an average 
fidelity of 99.92% over all gates and qubits (Supplementary Information). 
The highest fidelities are achieved by optimizing the pulse amplitude and 
frequency, and minimizing two-state leakage’’. 

We have also measured the performance when simultaneously oper- 
ating nearest-neighbour or next-nearest-neighbour qubits”, with the 
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Figure 1 | Architecture. a, Optical image of the integrated Josephson 
quantum processor, consisting of aluminium (dark) on sapphire (light). The 
five cross-shaped devices (Qo-Q,) are the Xmon variant of the transmon 
qubits*’, placed in a linear array. To the left of the qubits are five meandering 
coplanar waveguide resonators used for individual state readout. Control 
wiring is brought in from the contact pads at the edge of the chip, ending at 
the right of the qubits. b, Circuit diagram. Our architecture uses direct, 
nearest-neighbour coupling of the qubits (red/orange), made possible by the 
nodal connectivity of the Xmon qubit. Using a single readout line, each qubit 
can be measured using frequency-domain multiplexing (blue). Individual 
qubits are driven through capacitively coupled microwave control lines (XY), 
and frequency control is achieved through inductively coupled d.c. lines (Z) 
(violet). c, Schematic representation of an entangling operation using a 
controlled-phase gate with unitary representation Ucz; (I) qubits at rest, at 
distinct frequencies with minimal interaction; (II) when brought near 
resonance, the state-dependent frequency shift brings about a rotation 
conditional on the qubit states; (III) qubits are returned to their rest frequency. 


Frequency 


qubits at dissimilar idle frequencies to minimize coupling. The fideli- 
ties are essentially unchanged, with small added errors of < 10 * 
(Supplementary Information), showing a high degree of addressability 
for this architecture. 

The two-qubit controlled-phase gate is implemented by tuning one 
qubit in frequency along a ‘fast adiabatic trajectory”® that takes the two- 
qubit state | 11) close to the avoided level crossing with the state |02) (ref. 23), 
yielding a state-dependent relative phase shift (Fig. 3a). This implemen- 
tation is the natural choice for weakly anharmonic, frequency-tunable 
qubits, because the other computational states are left unchanged**”’. 
It is advantageous that the controlled-phase gate is adiabatic as well as 
fast. An adiabatic trajectory is easily optimized and allows for leakage 
into the non-computational state |02) to be exponentially suppressed 
with gate duration, because slower gates are less likely to lead to 
undesired transitions’®. Having a fast controlled-phase gate minimizes the 
accumulation of errors from decoherence and unwanted entanglement 
with other circuit elements, which is favourable for fault tolerance. 
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Figure 2 | Single-qubit randomized benchmarking. a, A reference 
experiment is performed by generating a sequence of m random Cliffords, 
which are inverted by the recovery Clifford, C,. A specific gate, H, is tested using 
a sequence that interleaves H with m random Cliffords. The difference between 
interleaved and reference decay gives the gate fidelity. b, Representative 

pulse sequence for a set of four Cliffords and their recovery, generated with 
and 1/2 rotations about X and Y, displaying both the real (I) and imaginary (Q) 
microwave pulse envelopes before up-conversion by quadrature mixing to 
the qubit frequency. c, Randomized benchmarking measurement for the set of 
single-qubit gates for qubit Q,, plotting reference and interleaved sequence 
fidelities as functions of the length, m; the fidelity for each value of m was 
measured for k = 40 different sequences. The fit to the reference set yields an 
average error per Clifford of r,.¢ = 0.0011, consistent with an average gate 
fidelity of 1 — rye¢/1.875 = 0.9994 (Supplementary Information). The dashed 
lines indicate the thresholds for exceeding gate fidelities of 0.998 and 0.999. The 
fidelities for the single-qubit gates are tabulated in the key. We find that all gates 
have a fidelity greater than 0.999. The error bars on the data points are the 
standard deviations from the mean. The uncertainty in gate fidelity is typically 
5 X 10°, determined by bootstrapping. 


The benchmarking results of the controlled-phase gate are shown in 
Fig. 3b. Similar to the single-qubit case, we generate sequences of two- 
qubit Cliffords to produce a reference curve, and then interleave the 
controlled-phase gate to extract the fidelity. An example pulse sequence 
for an m = 55 Clifford sequence is shown in Fig. 3c. We find a controlled- 
phase gate fidelity of up to 99.44 + 0.05%, consistent with the average 
error per Clifford (Supplementary Information). We find fidelities of 
between 99.0% and 99.44% for all four pairs of nearest-neighbour qubits 
(Supplementary Information). This is a clear demonstration of high- 
fidelity single- and two-qubit gates in a multi-qubit Josephson quantum 
processor. The two-qubit gate fidelity compares well with the highest 
values reported for other mature quantum systems: for nuclear mag- 
netic resonance and ion traps, entangling gate fidelities are as high as 
99.5% (ref. 11) and 99.3% (ref. 12). Importantly, we have verified by simu- 
lation that the experimentally obtained gate fidelities are at the threshold 
for surface code quantum error correction (Supplementary Information). 

We are optimistic that we can improve on these gate fidelities with 
modest effort. The controlled-phase gate fidelity is limited by three error 
mechanisms: decoherence (55% of the total error), control error (24%) 
and state leakage (21%) (Supplementary Information). Decoherence 
can be suppressed with enhanced materials and optimized fabrication”®”’. 
Imperfections in control arise primarily from reflections and stray induc- 
tances in wiring, and can be improved using conventional microwave 
techniques. Given the adiabatic nature of the controlled-phase gate, 
two-state leakage can be suppressed by slightly increasing the gate time’”. 

We showcase the modularity of this set of quantum logic gates by con- 
structing a maximally entangled Greenberger-Horne-Zeilinger (GHZ) 
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state across all five qubits in our processor (Fig. 4, top). The N-qubit 


GHZ state |GHZ) = (|0)®” +1) ®) /V2 is constructed with single- 
and two-qubit gates, using simultaneous control and readout of all 
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Figure 4 | Quantum state tomography and generation of the GHZ states. 
Top: respective real parts of the density matrix p for the N = 2 Bell state and the 
N= 3,4 and 5 GHZ states, measured by quantum state tomography. Ideal 
density matrix elements are transparent, with value 0.5 at the four corners. 
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Figure 3 | Controlled-phase gate physics and randomized benchmarking 
results. a, We use the |1g1,) and |0g2,) avoided level crossing to implement a 
high-fidelity controlled-phase gate, with the fast adiabatic tuning of qubit A 
giving a selective m phase change of |1g1,). The energy level diagram shows 
qubit A approaching and leaving the avoided level crossing from 

above (top, blue dashed line), following a fast (43 ns) yet effectively adiabatic 
trajectory (bottom, solid blue line). Unwanted state leakage from |11,) to 
|0g2,) (red dashed line) is minimized by adjusting the trajectory. 

b, Randomized benchmarking data (k = 100) of the controlled-phase gate (CZ) 
for the qubit pair Q, and Q;, using the two-qubit Clifford group, C, 
(Supplementary Information). Reference data are in black (r,.¢ = 0.0189); 
interleaved data are in blue (rc, +cz = 0.0244). Dashed lines indicate the 
thresholds for gate fidelities of 0.98 and 0.99. We find a controlled-phase gate 
fidelity of 0.9944 + 0.0005 (uncertainty from bootstrapping). c, Coherent 
microwave (XY) and frequency (Z) control of the quantum state while 
performing a complex two-qubit algorithm; the sequence contains more than 
500 gates, corresponding to the characteristic reference decay of m = 55, and 
is more than 7 js long. The rightmost panel shows an example Clifford 

from the iSWAP class, comprising single-qubit rotations and two controlled- 
phase gates (Supplementary Information). 


qubits. This algorithm is shown in Fig. 4 (bottom), the state is assem- 
bled by entangling one additional qubit at a time. The algorithm is highly 
sensitive to control error and decoherence on any computational ele- 
ment. We fully characterize the Bell and GHZ states by using quantum 
state tomography”, where quadratic maximum-likelihood estimation is 
used to extract each density matrix, p, from the measurement data, 
while satisfying the physical constraints that p be Hermitian, have unit 
trace and be positive semi-definite (Supplementary Information). The 
density matrices are plotted in the traditional cityscape style, and show 
significant elements only at the ideal locations. We find respective state 
fidelities of Tr(PideaP) = 99.5% + 0.4%, 96.0% + 0.5%, 86.3% + 0.5% 
and 81.7% + 0.5% for the N = 2 Bell state and the N = 3, 4 and 5 GHZ 
states. A GHZ state fidelity of more than 50% satisfies the criterion for 
genuine entanglement”. It is interesting to note that the ratios of off- 
diagonal to diagonal amplitudes Pio) 2 Jy 2 F / 242% yes Pj1y2® j1y@% 
have the values 0.99, 0.98, 0.99 and 0.99, suggesting that dephasing is 
small, uncorrelated or both. The five-qubit GHZ state is the largest to- 
mographic measurement of multi-qubit entanglement demonstrated 
so far in the solid state*”, and has a state fidelity similar to results obtained 
in ion traps”. This demonstrates that complex quantum states can be 
constructed with high fidelity in a modular fashion, highlighting the 
potential for more intricate algorithms on this multipurpose quantum 
processor. 


Y/2 


! 


Bottom: algorithm used to construct the states. See Supplementary Information 
for Im(p), the Pauli operator representation and the full gate sequence, which 
includes Hahn spin-echo pulses. 
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We have shown single- and two-qubit gates with fidelities at the fault- 
tolerant threshold for the surface code in an integrated circuit quantum 
processor. With this demonstration, Josephson quantum devices are 
now ready to explore fault-tolerant, multi-qubit computing. Extending 
the linear array of qubits to larger numbers of qubits is straightforward, 
and generating a two-dimensional grid of qubits seems to be reduced to 
the engineering challenge, albeit considerable, of adding more qubits, 
wiring and readout while maintaining coherence and gate fidelity (Sup- 
plementary Information). In a separate experiment, we have demon- 
strated qubit state measurement with 99% fidelity in 140 ns (ref. 17), 
with a design that can be seamlessly integrated with this architecture. 
The combination of high-fidelity logic, a multi-qubit architecture, and 
fast and accurate qubit readout provides the essential ingredients for a 
Josephson surface code quantum computer. 


METHODS SUMMARY 


We have achieved high-fidelity single-qubit gates with a series of automated exper- 
iments that minimize the error in frequency, amplitude and two-state leakage. The 
qubit frequency is measured by performing a Ramsey experiment. The amplitudes 
for m and 7/2 rotations are optimized by concatenating many gates to amplify small 
rotation errors. The two-state leakage is optimized by minimizing phase error accu- 
mulated through repeated X/2 and —X/2 gates; see ref. 21 for details. 

The controlled-phase waveform was optimized for phase shift and adiabaticity. 
We determine the conditional phase through a Ramsey experiment as in Fig. 1c. The 
qubit trajectory is optimized for adiabaticity when leakage from |1g1,) to |0g2,) is 
minimized. We quantify leakage by initializing the qubits to |1g1,), performing a 
controlled-phase gate and measuring leakage into |0p2,). 

Parameters for single- and two-qubit gates were fine-tuned or confirmed to be 
optimal by using Clifford-based randomized benchmarking. 

For randomized benchmarking, we generate k random sequences of Cliffords 
for each sequence length m. 
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Electroreduction of carbon monoxide to liquid fuel 
on oxide-derived nanocrystalline copper 


Christina W. Li!, Jim Ciston? & Matthew W. Kanan! 


The electrochemical conversion of CO, and H,O into liquid fuel 
is ideal for high-density renewable energy storage and could pro- 
vide an incentive for CO, capture. However, efficient electrocata- 
lysts for reducing CO, and its derivatives into a desirable fuel’* are 
not available at present. Although many catalysts*"' can reduce CO, 
to carbon monoxide (CO), liquid fuel synthesis requires that CO is re- 
duced further, using H,O asa H* source. Copper (Cu) is the only known 
material with an appreciable CO electroreduction activity, but in bulk 
form its efficiency and selectivity for liquid fuel are far too low for prac- 
tical use. In particular, H.O reduction to Hz outcompetes CO reduc- 
tion on Cu electrodes unless extreme overpotentials are applied, at which 
point gaseous hydrocarbons are the major CO reduction products’. 
Here we show that nanocrystalline Cu prepared from Cu,O (‘oxide- 
derived Cu’) produces multi-carbon oxygenates (ethanol, acetate and 
n-propanol) with up to 57% Faraday efficiency at modest potentials 
(-0.25 volts to -0.5 volts versus the reversible hydrogen electrode) in 
CO-saturated alkaline H,O. By comparison, when prepared by tra- 
ditional vapour condensation, Cu nanoparticles with an average crys- 
tallite size similar to that of oxide-derived copper produce nearly 
exclusive H, (96% Faraday efficiency) under identical conditions. 
Our results demonstrate the ability to change the intrinsic catalytic 
properties of Cu for this notoriously difficult reaction by growing 
interconnected nanocrystallites from the constrained environment 
of an oxide lattice. The selectivity for oxygenates, with ethanol as the 
major product, demonstrates the feasibility of a two-step conversion 
of CO; to liquid fuel that could be powered by renewable electricity. 

Cu electrodes have been thoroughly evaluated in both CO, (refs 7, 
14, 15) and CO reduction’*"*'*”” electrolyses. In CO-saturated aqueous 
solutions, polycrystalline Cu foil produces a mixture of compounds that 
are dominated by H, at low overpotential, by CO and HCO, at high 
overpotential and by hydrocarbons and multi-carbon oxygenates at the 
most extreme potentials'*’’. When supplied with CO in the absence of 
CO,, Cu produces hydrocarbons and multi-carbon oxygenates, but very 
negative potentials are still required to promote CO reduction over H, 
evolution’*”’. Large overpotentials preclude energetically efficient elec- 
trolysis and favour hydrocarbons over liquid oxygenates. We recently 
discovered that oxide-derived Cu (OD-Cu) has much higher selectivity 
for CO, electroreduction over H, evolution than does polycrystalline Cu 
(ref. 20). Because CO reduction activity may be obscured in the presence 
ofa large excess of CO2, we questioned whether OD-Cu would reduce CO 
with greater efficiency than other Cu materials when supplied with CO 
directly. Here we show that OD-Cu reduces CO with high Faraday 
efficiency at an overpotential >0.6 V lower than that of polycrystalline 
Cu foil. This improvement results both from an increase in the intrin- 
sic CO reduction activity and from a decrease in intrinsic HO reduc- 
tion activity on OD-Cu. The major CO reduction products are ethanol 
and acetate, which correspond to 8e and 4e” reductions with H,O as 
the H™ source (equations (1) and (2), where E is the equilibrium potential). 


2CO + 7H,0+ 8e —CH3CH,0H+ 8HO™ 
E=0.18 V versus RHE 


(1) 


2CO + 3H,0+4e —CH3CO, +3HO™ 
E=0.50 V versus RHE 


(2) 
By comparing OD-Cu to electrodes comprised of commercial Cu nano- 
particles, we show that CO reduction activity is not a consequence of 
nanocrystallite size or morphology; instead, we propose that grain boun- 
daries participate in the catalysis. 

OD-Cu electrodes were prepared by annealing polycrystalline Cu 
foil in air at 500 °C to grow a thick Cu,O layer on the surface and sub- 
sequently reducing this oxide to form Cu° nanocrystallites. We used 
two reduction methods to vary the properties of the material. To obtain 
OD-Cu 1, the Cu,O precursor was reduced electrochemically in aque- 
ous solution at ambient temperature. This procedure had previously been 
found to yield nanocrystalline Cu with maximal selectivity for CO, 
reduction”. To obtain OD-Cu 2, the Cu2O precursor was reduced with 
H, at 130 °C. The OD-Cu electrodes were compared to commercial Cu 
nanoparticles that had been synthesized by vapour condensation. Cu 
nanoparticle electrodes were prepared by drop-casting suspensions of 
the nanoparticles and fluorinated binder onto metal substrates. 

All three types of electrodes exhibited similar morphologies when 
imaged by scanning electron microscopy (SEM) (Fig. 1a, d, g). Particle 
sizes ranged from about 30 nm to 100 nm, and the particles formed aggre- 
gated films. Low-resolution transmission electron microscopy (TEM) 
confirmed the range of particle sizes observed via SEM (Fig. 1b, e, h). 
High-resolution TEM of OD-Cu 1 and OD-Cu 2 showed interconnected 
nanocrystalline networks with distinct grain boundaries between nano- 
crystallites (Fig. 1f, i). In contrast, the Cu nanoparticle electrode appeared 
as an aggregation of overlapping particles (Fig. 1c). Differential capaci- 
tance measurements yielded electrochemical roughness factors of 135, 
48 and 26 for OD-Cu 1, OD-Cu 2 and the Cu nanoparticle electrode, 
respectively (Extended Data Fig. 5 and Extended Data Table 2). X-ray 
photoelectron spectroscopy indicated that the surfaces were comprised 
of both Cu’ and Cu,0, the latter resulting from air exposure during the 
X-ray photoelectron spectroscopy sample preparation (Extended Data 
Figs 1, 2 and 3). Grazing incidence X-ray diffraction patterns for all of 
the electrodes exhibited peaks at the expected positions for an ideal Cu 
lattice, indicating no uniform expansion or compression of the unit cell 
(Fig. 1j, k, 1). The patterns differed substantially in their peak widths, 
however. Using the Scherrer equation, both OD-Cu 2 and the Cu nano- 
particle electrode exhibited average crystallite sizes (D) > 100 nm, while 
OD-Cu 1 had a smaller (D) of 31 nm. Additional high-resolution graz- 
ing incidence X-ray diffraction patterns were collected for OD-Cu 1 
and OD-Cu 2 with synchrotron radiation to determine the amount of 
microstrain in each sample using Williamson-Hall analysis. This ana- 
lysis yielded microstrain values of 0.2% and 0.05% for OD-Cu 1 and 
OD-Cu 2, respectively (Extended Data Fig. 4). 

CO electroreduction activity was measured under steady-state con- 
ditions by performing constant-potential electrolyses for multiple hours 
in 0.1 M KOH saturated with 1 atm of CO at ambient temperature. The 
concentration of CO in solution is only 1 mM under these conditions. 
Solution-phase and gas-phase products were quantified by nuclear magnetic 
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Cu nanoparticles 


Oxide-derived Cu 1 


Oxide-derived Cu 2 


Figure 1 | Physical characterization of Cu nanoparticle and OD-Cu 
electrodes. Top row, the Cu nanoparticle electrode. Middle row, the OD-Cu 1 
electrode. Bottom row, the OD-Cu 2 electrode. a, d, g, SEM images. 


resonance (NMR) spectroscopy and gas chromatography, respectively. 
Electrolyses were performed at a range of applied potentials E between 
-0.25 V and -0.5 V versus the reversible hydrogen electrode (RHE). 
The modest reducing potential and low CO concentration provides a 
stringent test ofa catalyst’s selectivity for CO versus H,O reduction. Under 
these conditions, polycrystalline Cu foil exhibited low geometric current 
densities and H, was the only detectable product. These results are con- 
sistent with previous results in KOH, in which much more negative 
potentials (-0.7 V versus RHE) and a high flow rate of CO through the 
solution were required to attain 22% total Faraday efficiency for CO 
reduction, with only 7% Faraday efficiency for oxygenates’*. A maxi- 
mum CO reduction Faraday efficiency of 65% was reported for poly- 
crystalline Cu foil at -0.9 V versus RHE in a mixed KCl and KHCO; 
electrolyte, although only 10% Faraday efficiency corresponded to the 
formation of oxygenates’”. 

Larger geometric current densities were obtained with OD-Cu and 
Cu nanoparticle electrodes as a result of their roughness factors, which 
facilitated product quantification and comparison of their selectivities 
(Fig. 2 and Extended Data Table 1). For the Cu nanoparticle electrode, 
the Faraday efficiency for H, was =94% at all potentials examined; the 
small remainder of the current corresponded to acetate, ethanol and 
ethylene formation. The high selectivity for H,O reduction on this elec- 
trode mimics that of a Cu foil electrode. In contrast, OD-Cu electrodes 
had a much greater propensity for CO reduction. OD-Cu 1 attained a 
total CO reduction Faraday efficiency of 57% at -0.3 V, whereas OD- 
Cu 2 attained 48% at -0.4 V. These values declined at more negative 
potentials, but this decline was simply the result of the catalysts reach- 
ing the mass-transport-limited current density for CO reduction (see 
below). For both materials, ethanol and acetate were the nearly exclusive 
CO reduction products at the more positive potentials, whereas ethyl- 
ene production emerged at more negative potentials. OD-Cu 2 also 
produced substantial amounts of propanol, which was only observed 
for OD-Cu 1 under higher CO pressure (see below). In addition to the 
reduction products, OD-Cu electrodes also produced small amounts of 
formate, the product of CO hydration. 
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100 120 140 
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b, e, h, Low-magnification TEM images. c, f, i, High-resolution TEM images. 
j,k 1, Grazing incidence X-ray diffraction patterns. 


The higher Faraday efficiency for CO reduction on OD-Cu electro- 
des compared to a Cu nanoparticle electrode reflected higher intrinsic 
CO reduction activity. Figure 2b and c shows the geometric and surface- 
area-normalized current densities for CO reduction (jcoredn) versus E 
for the three electrodes. The difference in intrinsic activity between two 
electrodes corresponds to the ratio of their normalized current densities 
in the absence of mass-transport contributions to the kinetics. The rate 
of CO transport from bulk solution through the diffusion layer to the 
electrode surface determines the maximum attainable geometric jcoredn 
(ref. 21). The Cu nanoparticle electrode did not reach a mass-transport 
limit between -0.3 V and -0.5 V, as shown by the exponential increase 
in jcoredn across this potential range. Its Tafel slope was 112 mV dec ' 
(where ‘dec’ means decade, or one order of magnitude). On account of 
their higher activity, the geometric jcoredn On OD-Cu electrodes quickly 
reached a plateau value of 0.5 mA cm” as the overpotential was increased 
(Fig. 2b), indicating that this value is approximately the mass transport 
limit with only 1 mM CO in solution. At -0.3 V, before the onset of the 
mass transport contributions, normalized jcorean Was 23-fold higher 
for OD-Cu 1 and 43-fold higher for OD-Cu 2 than for the Cu nano- 
particle electrode (Fig. 2c). To determine the dependence of jcoredn ON 
E for OD-Cu electrodes in the absence of mass-transport effects, addi- 
tional electrolyses were performed in the range -0.25 V > E> -0.35 V 
(Extended Data Fig. 8). A Tafel slope of approximately 113 mV dec™’ 
was obtained. 

The electrodes also differed in their intrinsic activities for HO reduc- 
tion. Figure 2d and e shows the normalized current density for H,O 
reduction (j}2) in CO-saturated and N,-saturated 0.1 M KOH, respec- 
tively. All three electrodes exhibited an HO reduction activity 2-fold 
to 5-fold lower in the presence of CO than in the presence of No, 
indicating some poisoning of the sites for HO reduction by CO binding”. 
In CO-saturated electrolyte, normalized jy. was similar for the Cu nano- 
particle electrode and OD-Cu 2, but was 2-fold to 6-fold lower for OD- 
Cu 1. The suppression of H,O reduction contributes to the higher Faraday 
efficiency for CO reduction on OD-Cu 1. In N,-saturated electrolyte, 
juz Was suppressed 4-8-fold on OD-Cu 1 and 2-6-fold on OD-Cu 2 
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Figure 2 | Comparison between OD-Cu 1, OD-Cu 2 and Cu nanoparticle 


electrodes in electrolyses performed in 0.1 M KOH saturated with 1 atm CO 
at ambient temperature. a, Faraday efficiencies for CO reduction products 
ethanol (EtOH), acetate (AcO ), n-propanol (n-PrOH), ethylene (C,H,) and 
ethane (C,H,) at selected potentials versus the reversible hydrogen electrode. 
The remaining Faraday efficiency in all cases corresponds to H2 evolution. The 
current density for the Cu nanoparticle electrode at -0.25 V was too low for 
product quantification. b, Geometric current densities for CO reduction. 

c, Surface-area-normalized current densities for CO reduction. d, Surface-area- 
normalized current density for H, evolution under 1 atm of CO. e, Surface- 
area-normalized current density for Hz evolution under 1 atm of No. 


compared to the Cu nanoparticle electrode. The similar Tafel slopes 
for H, evolution suggest that OD-Cu electrodes have a lower density of 
active sites for this reaction. 

To confirm that the geometric jcoyedn at 1 atm CO is limited by the 
kinetics of CO mass transport, OD-Cu 1 was evaluated in an electro- 
lysis performed in 0.1 M KOH saturated with 2.4 atm of CO. At poten- 
tials between -0.3 V and -0.5 V, geometric jcoredn Was 1.8-2.4-fold higher 
under these conditions (Fig. 3a). The Faraday efficiency for CO reduc- 
tion at E< -0.3 V was also substantially improved (Fig. 3b). These results 
indicate that practical current densities may be possible in this moderate 
overpotential regime by further increasing CO transport to the catalyst. 

The electrolysis data provide some insight into the mechanism of CO 
reduction on OD-Cu. The jcorean Tafel slope is consistent with a rate- 
limiting initial le” transfer to CO. The absence of C1 products indicates 
that C-C coupling is rapid once the process of CO reduction initiates” 
or that the initial e" transfer is coupled to C-C bond formation between 
a surface-bound CO anda CO from solution’’. The latter would account 
for increased jcoredn at increased pressure if the active sites of OD-Cu 
were saturated with CO at 1 atm (refs 24, 25). The formation of ace- 
tate probably arises from attack of HO” on a surface-bound ketene or 
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Figure 3 | Comparison of CO reduction in 0.1 M KOH saturated with 1 atm 
of CO versus 2.4 atm of CO. a, Geometric current densities for CO reduction. 
b, Faraday efficiencies for CO reduction products at selected potentials. 


other carbonyl-containing intermediate after C-C bond formation has 
occurred. This contention is supported by the observation of substantially 
increased acetate formation when the electrolysis is performed in 1 M 
KOH (Extended Data Fig. 9). 

The results above implicate the participation of grain boundary sur- 
faces in the CO reduction catalysis on OD-Cu. Although OD-Cu 1 and 
OD-Cu 2 differ in their bulk properties, both are distinguished from 
Cu nanoparticle electrodes by having a high density of grain bound- 
aries as a consequence of their formation from a solid phase. Given that 
grain boundaries can support surface structures that are not stable on 
an individual nanocrystal’®, the enhanced CO reduction activity on OD- 
Cu electrodes relative to Cu nanoparticle electrodes is consistent with 
the presence of highly active sites on their grain boundary surfaces. The 
difference in their product distribution could reflect differences in grain 
boundary structures resulting from electrochemical versus Hz reduc- 
tion of Cu,O. The results suggest that engineering the grain boundaries 
by altering the oxide reduction method or by using an alternative pre- 
paration of nanocrystalline materials could yield catalysts with higher 
activity and selectivity for long-chain products. The participation of grain 
boundary surfaces in catalysis has previously been proposed to explain 
correlations between grain boundary density and methanol electroox- 
idation activity with Pt nanoparticles” and PtRu alloy nanoparticles”. 

Catalytic processes have been optimized over the past century to 
reduce CO with H, to alkanes, methanol or higher alcohols. Although 
many of these processes are efficient, they all require high temperatures 
and pressures (typically 250-400 °C and 50-150 atm) and large-scale 
reactors that are poorly matched to diffuse renewable energy sources”. 
Moreover, H, is almost exclusively derived from fossil fuel sources at 
present. Electrolytic syntheses enabled by the catalysis described here 
could provide a more direct, versatile and energetically efficient route 
to distributed liquid fuel production powered by renewable energy. 
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METHODS SUMMARY 


CO reduction electrolyses were performed in CO-saturated 0.1 M KOH electrolyte 
in a two-compartment electrochemical cell. The headspace of the cathodic chamber 
was continuously purged with CO into the sampling loop of a gas chromatograph 
to enable periodic quantification of the gas-phase products. The solution-phase pro- 
ducts were quantified by NMR analysis of the electrolyte at the conclusion of the 
electrolyses. OD-Cu electrodes were prepared by electropolishing pieces of poly- 
crystalline Cu foil (99.9999%) in 85% phosphoric acid and subsequently annealing 
the electrodes in air at 500 °C for 1 h to form a thick Cu,O layer. The active OD-Cu 
1 layer was formed in situ at the beginning of CO reduction electrolysis, requiring 
10-12 Ccm” of charge to fully reduce the Cu,O material. OD-Cu 2 was formed by 
reduction of the CuO layer in flowing H, at 130°C for 2h. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Hybrid shallow on-axis and deep off-axis 
hydrothermal circulation at fast-spreading ridges 
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Hydrothermal flow at oceanic spreading centres accounts for about 
ten per cent of all heat flux in the oceans'” and controls the thermal 
structure of young oceanic plates. It also influences ocean and crus- 
tal chemistry, provides a basis for chemosynthetic ecosystems, and 
has formed massive sulphide ore deposits throughout Earth’s his- 
tory. Despite this, how and under what conditions heat is extracted, 
in particular from the lower crust, remains largely unclear. Here we 
present high-resolution, whole-crust, two- and three-dimensional 
simulations of hydrothermal flow beneath fast-spreading ridges that 
predict the existence of two interacting flow components, controlled by 
different physical mechanisms, that merge above the melt lens to feed 
ridge-centred vent sites. Shallow on-axis flow structures develop owing 
to the thermodynamic properties of water, whereas deeper off-axis 
flow is strongly shaped by crustal permeability, particularly the brittle- 
ductile transition. About 60 per cent of the discharging fluid mass 
is replenished on-axis by warm (up to 300 degrees Celsius) recharge 
flow surrounding the hot thermal plumes, and the remaining 40 per 
cent or so occurs as colder and broader recharge up to several kilo- 
metres away from the axis that feeds hot (500-700 degrees Celsius) 
deep-rooted off-axis flow towards the ridge. Despite its lower con- 
tribution to the total mass flux, this deep off-axis flow carries about 
70 per cent of the thermal energy released at the ridge axis. This 
combination of two flow components explains the seismically deter- 
mined thermal structure of the crust and reconciles previously incom- 
patible models favouring either shallower on-axis** or deeper off-axis 
hydrothermal circulation®*. 

The fast-spreading East Pacific Rise (EPR) at 9° N is one of the best- 
studied ridge sections worldwide. Multichannel seismic studies between 
9° N and 13° N image a nearly continuous sub-axial melt lens at depths 
of about 1.2-2.4 km (ref. 9). A tomographic study" at 9° 30’ N shows a 
narrow P-wave anomaly below the melt lens that is best modelled as a 
4-5-km-wide, high-temperature region extending through the whole 
crust and only widening at depths below the crust-mantle boundary 
(Fig. 1a). Isotherms are steep near this ‘hot slot’ but become nearly hori- 
zontal further off-axis, consistent with field observations from the Oman 
ophiolite”"'. Axis-perpendicular two-dimensional (2D) hydrothermal 
flow models'*’? show that this implies hydrothermal convection through 
the entire crust’, with hydrothermal fluids probably migrating parallel 
to the steep isotherms and thereby altering the lower gabbros at some 
distance away from the ridge axis'*. This is in contrast to the conclu- 
sion, drawn from microearthquake distributions’ at EPR 9°N, that 
hydrothermal cells develop predominantly above the melt lens—a con- 
cept apparently supported by vent fluid chemistry studies’® and pre- 
vious three-dimensional (3D) simulations of hydrothermal flow at 
fast-spreading ridges*””. 

To investigate how whole-crustal cooling can be reconciled with supra- 
melt lens fluid circulation, we developed a 3D model that resolves hydro- 
thermal flow throughout the entire thickness of the oceanic crust at a 
fast-spreading ridge. We obtain the permeability structure for this 3D 
model from 2D numerical experiments in which we simultaneously solve 
for crustal accretion processes and cooling by hydrothermal flow’®. This 


2D model predicts the thermal structure of the young crust and the 
depth of the melt lens for a given permeability field and a full spreading 
rate of 11 cmyr (see ref. 18 and Methods). Permeability decreases expo- 
nentially with depth and linearly with distance from the ridge axis and 
was varied until (1) the predicted crustal temperature field matches the 
one inferred from seismic tomography” and (2) the melt lens (corres- 
ponding to the 1,200 °C isotherm) is located at the observed depth? 
of 1,400 m. Key features of this parameterized permeability field (see 
Methods and Extended Data Fig. 1) are relatively high permeabilities 
adjacent to the margins of the hot slot, mimicking the effects of a thermal 
cracking front’. This is consistent with microcracks found in lower 
gabbros’ and in peridotites a few hundred metres below the crust-mantle 
boundary” of the Oman ophiolite. An additional decrease in permeab- 
ility beyond a distance to the ridge of 5 km is required to reproduce the 
seismically inferred nearly horizontal isotherms" in this region and to 
be consistent with the low permeabilities inferred from porosity logs 
from drill holes in more mature oceanic crust*!. The brittle-ductile trans- 
ition in the lower crust” is accounted for by decreasing permeability to 
10 18m? between 600 °C and 800°C, so that no significant fluid flow 
occurs through rock hotter than 800°C. The top of the model has a 
250-m-thick pillow basalt layer”? in which permeability increases to 
10 '* m/ at the sea floor. 

The 2D results (Fig. 1b) show the thermal evolution of the oceanic 
crust after 500,000 years of simulation time in response to the interplay 
of crustal accretion processes and hydrothermal cooling. The hydro- 
thermal flow patterns predicted by the 2D model (Fig. 1c) hint at two 
coexisting flow components: (1) on-axis convection above the melt lens 
with strong adjacent recharge flow and (2) much broader and deeper off- 
axis circulation. Both flow components merge above the melt lens and 
supply the axial discharge. 

Using a 3D hydrothermal flow model (see Methods) we constrain 
the relative importance of on- versus off-axis flow and explore the de- 
tailed flow structures developing over shorter timescales of a few thou- 
sand years. Whereas previous 3D*”” and across-axis 2D'*!*"* models 
had focused either on supra-melt lens or ridge-perpendicular flow, we 
use a whole-crust 3D finite-element model that resolves both flow com- 
ponents simultaneously. Fluid properties are determined from the pure 
water [APS-84 equation of state, so that the fluid is in the supercritical 
single-phase regime for the hydrostatic pressures prevailing at the sea- 
floor. The 1,000 °C isotherm derived from the 2D results at steady state 
is used as a constant-temperature lower boundary for the 3D domain, 
whereas the top boundary follows a smoothed-average bathymetry of 
the EPR at 9° N. Fluids are free to enter (at a temperature of 2 °C) or to 
leave the domain through the top boundary only. Sidewalls are imper- 
meable and insulating. This set-up results ina 16 km X 6 km X 6.5 km- 
sized computational domain that is meshed with about 25 million 
tetrahedral elements (Extended Data Fig. 2). Each 3D simulation ran 
on a cluster with 32 processors for about 3 weeks. 

Our 3D results show the formation of a single hybrid hydrothermal 
system (Fig. 1d, e). On-axis upflow organizes into a network of closely 
interconnected thermal plumes that feed high-temperature (up to 410 °C) 
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Figure 1 | Results of our 2D and 3D 
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axial vent fields with an along-axis spacing between 500 m and 1,000 m. 
Partial ‘on-axis’ recharge surrounds the thermal plumes and does not 
penetrate deeper than the depth of the melt lens (see Fig. le), consistent 
with earlier findings by ref. 4. In contrast, ‘off-axis’ recharge flow occurs 
over a broad area and feeds very hot (500 °C-700 °C) ridge-perpendicular 
flow upslope the flanks of the hot slot, showing no significant ridge- 
parallel motion until the melt lens depth is reached, where the fluid 
starts to focus into tubes with a spacing similar to that of the on-axis 
plumes (Fig. 1d, e). Near the top of the melt lens both flow components 
merge before feeding the axial vent fields, where fluids are released at a 
rate of 80-90kgs~' per kilometre ridge axis. A quantification of the 
fluid mass fluxes as a function of distance to the ridge (Fig. 2a) shows 
that, on average, about 60% of the entire recharge flow occurs on-axis 
(within 1.1 km of the ridge axis), while about 40% of the venting fluids 
originate from distances greater than 1.1 km from the ridge axis. This 
ratio also holds for simulations with along-ridge variations in melt lens 
depth (see Extended Data Figs 4 and 5). 


steady broad off-axis circulation; 
positive values correspond to upward 
fluid flow. d, Our 3D model resolves 
hydrothermal convection in a 6-km- 
long and 16-km-wide modelling 
domain above the 1,000 °C isotherm 
(black bottom surface) obtained 
from the 2D results. Thermal 
evolution and vent field locations are 
shown after 3,500 years of simulation 
time. e, Flow paths of fluid tracers, 
colour-coded for temperature in a 
subsection of the 3D domain. 


1 2 3 4 5 


Pathways of 
numerical fluid tracers 
prior to venting on-axis 


An analysis of vertical mass fluxes versus temperature (Fig. 2b and 
Extended Data Fig. 3) reveals that fundamentally different processes 
shape on-axis (red bars) and off-axis (blue bars) flow. On-axis, most 
upflow occurs between 300 °C and 450 °C, whereas downflow is stron- 
gest between 100°C and 200 °C. These preferred temperature ranges 
are controlled by the thermodynamic properties of water***. Upflow 
occurs at temperatures around 425 °C, at which (for the prevailing 
pressures) the ability of water to transport energy by buoyancy-driven 
flow (that is, the fluid’s ‘fluxibility’*) is maximum. This also explains 
why even hotter fluids remain at depth and do not directly feed the on- 
axis plumes (see Figs 1d and 2c). Extending the fluxibility concept to 
predict the most efficient recharge flow* yields temperatures between 
100°C and 200°C, which is where we also identify the strongest on- 
axis recharge flow (Fig. 2b). The energy required to heat the recharge 
flow is provided by the rising plumes, explaining why recharge is stron- 
gest adjacent to hot plumes (see Fig. 2c). The mass flux distribution for 
the off-axis flow component (blue bars in Fig. 2b) differs significantly, 
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Figure 2 | Mass flux analysis. a, Vertical mass flux of discharging and 
recharging fluid flow, evaluated 300 m below the sea floor, as a function of 
distance to the ridge axis. All fluid discharge, but only ~60% of recharge, occurs 
within a distance of 1.1 km of the ridge axis. The remaining ~40% recharge 
comes from fluids infiltrating the crust at greater distances. b, Vertical mass flux 
as a function of fluid temperature for on-axis (red) and off-axis (blue) flow 
components. Regions within the highly permeable pillow basalt layer and near 


with upward mass flux extending to much higher temperatures and 
most of the recharge flow occurring at colder temperatures. This implies 
that factors in addition to the fluid’s fluxibility also control the deeper 
flow. Almost half of off-axis recharge flow occurs below 50 °C simply 
because no off-axis thermal plumes are present to heat the recharge. 
The extensive area of pervasive off-axis recharge compensates for these 
thermodynamically inefficient cold conditions. Upflow within the lower 
crust occurs over a broader range of higher temperatures and decreases 
towards 700 °C because permeability is reduced at the brittle—ductile 
transition (Fig. 2b, Extended Data Fig. 1). Hence, the off-axis upflow along 
the flanks of the hot slot is predominantly shaped by the crustal perme- 
ability structure, whereas the on-axis circulation forms thermal plumes 
more strongly controlled by the thermodynamic properties of water. 

Pressure-temperature paths of numerical fluid tracers (Fig. 3) fur- 
ther highlight the characteristics of on-axis (red paths) and off-axis 
(blue paths) fluid circulation. With fluxibility limiting the temperatures 
in rising plumes above the melt lens to values below about 450 °C, the 
hot (up to 700 °C) deep-rooted fluids migrating towards the top of the 
melt lens cannot rise further until they cool below about 450 °C by mix- 
ing with the on-axis component. This may explain why the maximum 
pressures of last fluid-rock equilibration correlate very well with the 
depth of the axial melt lens’®, even though a significant mass fraction of 
the venting fluids may actually have migrated through much deeper 
and hotter parts of the oceanic crust before venting. The pressure- 
temperature paths of on- and off-axis components also resolve an appar- 
ent discrepancy between in situ measurements of vent temperatures at 
the EPR and temperatures of hydrothermal circulation estimated from 
fluid inclusions in gabbros from the Oman ophiolite*’ and the extinct 
Mathematician ridge” (Fig. 3). Despite the clear upper limit in observed 
axial vent temperatures”, fluid inclusion data show much higher entrap- 
ment temperatures that are strikingly consistent with the temperatures 
our 3D model predicts for most deep off-axis upflow. 

Our simulations have a total hydrothermal energy flux of about 
9 X 10’ W per kilometre ridge axis, which is in excellent agreement with 
estimates for the total magmatic energy input” at a ridge with a full 
spreading rate of 11 cm yr” '. The model allows us to quantify the energy 
fluxes from upper and lower crust into the ocean. Despite contributing 
only 40% to the total fluid mass flux, the 500 °C-700 °C upflow in the 
lower crust transports about 70% of the total energy released at the 
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the top of the melt lens have been excluded for clarification (see Extended Data 
Fig. 3). Percentage values are with respect to total upflow and downflow within 
the entire domain, respectively. c, Vertical fluid mass flux shows on-axis 
thermal plumes at temperatures below ~450 °C that are surrounded by strong 
recharge flow (dark blue) at temperatures above 100 °C—both effects are 
consequences of pressure- and temperature-dependent fluid properties*™. 


hydrothermal vent sites because it carries more thermal energy per 
kilogram of fluid than the <450 °C hot thermal plumes above the melt 
lens. This implies that not only is heat efficiently mined from the lower 
crust, but also that deep off-axis circulation may be the dominant 
process that cools the oceanic crust. Furthermore, the relative contri- 
butions of on- and off-axis hydrothermal flow to the total energy flux 
from crust to ocean are linked to the depth of the axial melt lens, where 
the transition in flow structure from hot (mainly permeability-controlled) 
flow in the lower gabbros to cooler (mainly fluid-property-controlled) 
flow in plumes occurs. 

This new hybrid hydrothermal flow structure reconciles previously 
incompatible observations and models that suggested either strong 
on-axis*° or deeper ridge-perpendicular hydrothermal circulation®””’. 
We find that both modes exist and naturally merge into a single hybrid 
flow structure in our whole-crust numerical experiments. The on-axis 
flow component is more vigorous and contributes slightly more to the 
total mass flux. It controls the depth of the melt lens and explains why 
high-temperature vent systems associated with fast-spreading ridges 
have so far been observed only directly on-axis. The deeper off-axis flow 
component provides about 70% of the hydrothermal energy flux and 
is what makes the overall thermal structure of the young oceanic crust 
consistent with seismic tomography data’? and the pervasive crustal- 
scale hydrothermal convection inferred from in situ analysis of the 
Oman ophiolite”’*. This implies that the entire oceanic crust experi- 
ences extensive high-temperature hydrothermal circulation, providing 
a mechanism to scavenge elements from the lower crust. Recent mass 
balance calculations reveal that the amounts of metal potentially mobi- 
lized by hydrothermal flow at ridges’* are far greater than estimates 
based on the size of known ridge-linked massive sulphide deposits”. 
Our simulations predict a cooling of about 200 °C of deeper off-axis 
fluids upon mixing with the on-axis component above the melt lens, 
which would lead to strong precipitation of dissolved metals near the 
roots of the sheeted dykes. Entrainment of cold sea water is likely to 
result in further re-deposition of metal within the pillow basalt layer. 
Finally, chemical differences between individual vents of the same vent 
field as observed at the neighbouring P and Bio9 vents”’ at EPR 9° 50’ N 
could result from mixing processes between fluids that obtained dis- 
tinct chemical signatures while cycling through the conjoined limbs of 
a single hybrid hydrothermal system. 
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Figure 3 | Comparison between observed and modelled hydrothermal fluid 
temperatures. In situ measurements of exit temperatures at EPR vents (see 
Methods) show a preferred range of 300 °C-400 °C. Temperature estimates 
based on fluid inclusions in gabbros from the Oman ophiolite” and the 
extinct Mathematician ridge*® (MR) are significantly higher. The pressure- 
temperature paths of on-axis (red) and off-axis (blue) fluid tracers in our 3D 
model explain the discrepancy. Highlighted paths 1 and 2 show warm recharge 
flow before ascending in plumes at temperatures below ~450 °C. In contrast, 
off-axis recharge flow (paths 3 and 4) is cold until reaching the steep thermal 
gradients at depth. Most of the subsequent upward flow occurs at 500 °C- 
700 “C—the temperature range suggested by fluid inclusion data. Fluid 
properties at these high temperatures are thermodynamically inefficient for 
plume formation” and off-axis fluids must cool by mixing with the on-axis 
component before they are able to feed a plume. Path 5 shows entrainment of 
cold sea water into a thermal plume within the pillow basalt layer. Dashed lines 
mark phase regions of sea water for orientation (L, liquid; V, vapour; H, halite) 
and CP marks the critical point of sea water. 


METHODS SUMMARY 


We constructed a 2D numerical model’* representing a vertical cross-section per- 
pendicular to a fast-spreading ridge to solve for the long-term thermal evolution 
of oceanic crust and mantle. Hydrothermal convection of pure water is coupled 
to crustal accretion by the temperature field. Fluid flow is treated as Darcy flow 
through a porous medium with parameterized permeability, and we solve for 
temperature, pore pressure and fluid velocity using an implicit finite-element method. 
Deformation of the crust is formulated as viscous creep of an incompressible 
medium and a dilation term is used to simulate crustal accretion in the melt lens 
and the dyking region. 2D simulations ran until a quasi-steady state was reached, 
typically after 200,000-500,000 years. The purpose of this model is to define a 
permeability field that leads to a thermal structure matching the one inferred from 
seismic data’®. Furthermore, the melt lens (corresponding to the 1,200 °C isotherm) 
should be located at the observed depth of 1,400 m. The obtained permeability 
field forms the basis for the 3D model, which covers the crustal volume above the 
1,000 °C isotherm in the 2D results and extends 6 km along the ridge and 8 km to 
each side of the ridge. In- and outflow are allowed through the top boundary only. 
The numerical formulation of hydrothermal flow is very similar to that used in the 
2D model, but solving the much larger system of equations required parallelization 
of the 3D model and iterative solution algorithms*’. Each 3D calculation starts at 
cold crustal conditions, with the exception ofa hot boundary layer along the bottom 
boundary that forms after 300 years of purely conductive heating before model 
initiation. Each 3D calculation simulated 2,000-8,000 years, after which vent field 
locations stabilized and discharging mass fluxes became nearly constant. 
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Plate tectonics, damage and inheritance 


David Bercovici’ & Yanick Ricard 


The initiation of plate tectonics on Earth is a critical event in our 
planet’s history. The time lag between the first proto-subduction 
(about 4 billion years ago) and global tectonics (approximately 3 bil- 
lion years ago) suggests that plates and plate boundaries became 
widespread over a period of 1 billion years. The reason for this time 
lag is unknown but fundamental to understanding the origin of 
plate tectonics. Here we suggest that when sufficient lithospheric 
damage (which promotes shear localization and long-lived weak zones) 
combines with transient mantle flow and migrating proto-subduction, 
it leads to the accumulation of weak plate boundaries and eventually 
to fully formed tectonic plates driven by subduction alone. We simu- 
late this process using a grain evolution and damage mechanism with 
a composite rheology (which is compatible with field and laboratory 
observations of polycrystalline rocks’), coupled to an idealized model 
of pressure-driven lithospheric flow in which a low-pressure zone is 
equivalent to the suction of convective downwellings. In the sim- 
plest case, for Earth-like conditions, a few successive rotations of the 
driving pressure field yield relic damaged weak zones that are inher- 
ited by the lithospheric flow to form a nearly perfect plate, with passive 
spreading and strike-slip margins that persist and localize further, 
even though flow is driven only by subduction. But for hotter sur- 
face conditions, such as those on Venus, accumulation and inher- 
itance of damage is negligible; hence only subduction zones survive 
and plate tectonics does not spread, which corresponds to observa- 
tions. After plates have developed, continued changes in driving forces, 
combined with inherited damage and weak zones, promote increased 
tectonic complexity, such as oblique subduction, strike-slip bound- 
aries that are subparallel to plate motion, and spalling of minor plates. 

The emergence of plate tectonics is arguably Earth’s defining moment. 
How our planet, alone amongst known terrestrial bodies, evolved the 
unique plate-tectonic form of mantle convection remains enigmatic’. 
Although geochemical and petrological evidence for tectonic activity in 
Earth’s deepest past is difficult to interpret*, analysis of zircons implies 
the existence of continental crust and formation of granites more than 
4 billion years (4 Gyr) ago”, possibly related to subduction zone melting. 
Similar evidence suggests that subduction-related crustal production 
existed 3.6 Gyr ago or earlier”®. However, geochemical and petrologic 
observations imply that tectonics was not widespread until 3.0-2.7 Gyr 
ago”'°. Whether plate tectonics spread globally over the intervening 
1-Gyr interval is of fundamental importance in understanding the emer- 
gence of plate tectonics. Moreover, whether the continued accumulation 
of plate boundaries increased tectonic complexity through geological 
time remains an open question. Here we propose a plate generation model 
to argue that transient proto-subduction and accumulated lithospheric 
damage allowed for the unique emergence and evolution of plate tec- 
tonics on Earth. 

Plate generation theory generally treats the formation of weak plate 
boundaries and strong plates by invoking self-weakening feedback 
mechanisms’. Models prescribing steady-state weakening rheologies (for 
example, viscoplasticity)'*"* often produce instantaneous plate-like flow, 
but are not necessarily consistent with rock-deformation experiments, 
and do not contain the physics to allow dormancy and accumulation of 
weak zones that become re-activated to form new plate boundaries'*”’. 


Damage models of plate generation have, however, been developed to 
treat self-weakening as well as the evolution of weakness, while also 
being compatible with laboratory and field observations’. 

Damage mechanics is a continuum theory for how deformation affects 
material strength, usually by the formation of micro-cracks or reduction 
of grain size, either of which weakens the material'®. The pervasiveness 
of mantle-lithosphere peridotitic mylonites—where localized deforma- 
tion correlates with reduced mineral grain sizes—at tectonic margins'”"* 
indicates that a self-weakening feedback between grain reduction by dam- 
age and grain-size-dependent strength (for example, diffusion creep)'”” 
at mid-lithospheric depths is key to plate-boundary formation. This 
grain-damage feedback mechanism is most plausible and evident in 
multi-phase or polycrystalline lithospheric rocks (for example, peri- 
dotite, which is a mixture of mostly olivine and pyroxene): grain growth 
in such rocks—which strengthens and heals the material—is known 
from laboratory and field observations to be impeded by pinning (that 
is, blocking of grain-boundary migration) by the interface between 
phases’’”’”?. Thus, our lithospheric damage mechanism is a coupled 
evolution model that describes the competition between damage and 
healing for both grains and interfaces in two-phase assemblages, and 
where interface pinning both limits grain growth and healing, and pro- 
motes damage and shear-localization’” (see Methods for further details). 

Our model medium is composed of two components or phases with 
comparable properties, combined into a 60-40% mixture (as with olivine- 
pyroxene ratios in peridotite), wherein the grain sizes and the rough- 
ness of the interface between phases change according to our evolution 
model'”. The two-phase grained continuum is incorporated into a hori- 
zontal viscous thin-sheet model of convective-type flow in the litho- 
sphere, assuming its strength is dominated by highly viscous flow and 
mylonitic behaviour at a mid-lithosphere depth (this assumption, how- 
ever, underestimates weakening and localization at divergent zones due 
to lithospheric thinning and melting). Flow velocity v is compelled by 
imposed sublithospheric pressure anomalies P; high or low pressures 
are akin to mantle convective currents impinging on or separating from 
this layer. The material rheology is a composite of dislocation creep (that 
is, viscosity goes inversely as stress squared), which dominates deforma- 
tion for large grains, and diffusion creep (that is, viscosity goes as grain- 
size cubed), which dominates for small grains. As deformation and 
damage drive grain reduction’, diffusion creep eventually dominates, 
causing the mixture viscosity [i to reduce where deformation is large, 
which thus focuses deformation and leads to a self-weakening feed- 
back. If fi were constant, the pressure would drive only divergence or 
convergence (over positive or negative P, respectively), or sources and 
sinks of mass in the thin layer, akin to ridges and subduction zones; the 
divergence rate is denoted by S=V-v where V is the horizontal two- 
dimensional gradient operator (see Methods). But with the nonlinear 
rheology of our medium, horizontal shearing and strike-slip motion 
can also arise; the vorticity 2 = z-V x v (where Z is the vertical unit vector) 
represents shearing because it measures velocity gradients perpendic- 
ular to the flow direction. Moreover, the driving pressure P can also 
interact with the nonlinear rheology to excite passive divergence that 
is not directly driven by any pressure anomaly. Focused divergence 
S and vorticity 2 are important metrics of plate-like flow because 
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they represent convergent, divergent and strike-slip motion on plate 
boundaries’. See Methods for further details. 

The model input requires several dimensionless ratios or numbers, 
which are derived from known physical quantities and laboratory experi- 
ments (see Methods). The key parameters controlling the efficiency of 
plate generation are the damage and healing numbers for both grains 
and interface; these numbers represent the rates of comminution and 
coarsening (for either grains or interface), respectively, relative to the 
rate of deformation. There are three damage numbers, that is, for the 
interface between phases and the grains of each phase, but they are 
typically equal to each other, and thus we denote them by one number 
D; for Earth’s conditions, D~100. There are likewise three healing 
numbers, C; for grain growth (where i = 1 or 2 depending on the phase) 
and C, for interface coarsening; unimpeded grain growth is typically 
much faster than interface healing’ and for Earth’s conditions we infer 
that both phases have C;~1, and the interface has C7~10~5. Several 
tests and comparisons of basic rheological mechanisms are discussed 
in Methods and shown in Extended Data. 

We first consider an idealized model of an early Archaean, single 
proto-subduction zone or downwelling, by imposing a pressure field P 
with one focused low-pressure region evolving with time (Fig. 1 top row), 
which is representative of time-dependent intermittent downwelling 
inferred to occur in the Archaean””’. In particular, the first downwelling 
is akin to a lithospheric drip, which typically cannot readily draw in 
lithospheric mass, and is therefore ephemeral. A new similar lithospheric 
downwelling is prescribed to nucleate perpendicularly offa pre-weakened 
end of the first downwelling, because other orientations would prob- 
ably be swept away by the initial flow field. The new downwelling is also 
relatively short-lived, and a third and then fourth downwellings simi- 
larly nucleate in turn. 

Even with a damage rheology, the first low-pressure region leads only 
to a convergence zone or sink of lithospheric mass (Fig. 1 left column). 
During the next downwelling stage, the new flow is influenced by the 
damaged weak zone left by the original downwelling, whose signature 
survives because of slow healing; this weak zone becomes a locus of 
vorticity and strike-slip motion (Fig. 1, second column). In the third 
stage, the weakness left from the second downwelling is adopted as a 
strike-slip margin, while that from the original downwelling becomes a 
focused spreading centre (Fig. 1, third column). As the process is repeated 
one last time, the accumulated weak zones comprise narrow divergent 
and strike-slip boundaries, which complement the driving subduction 
zone to make one contiguous weak plate margin bounding a rapidly 
moving tectonic plate (Fig. 1, forth column). Moreover, with each sub- 
duction zone jump, the divergence field S rebuilds to approximately the 
same magnitude, but the vorticity Q progressively grows and finally 
has a magnitude comparable to that of S (Fig. 1, bottom row). Although 
the calculation ceases after the last shift in the downwelling, we specu- 
late that the final plate-like flow would organize convection and limit 
further evolution of cold downwellings and drips by sweeping them 
into the existing subduction zone. 

Although this case is highly idealized, it shows that a fully developed 
plate can evolve from a downwelling only—where none of the bound- 
aries other than the subduction zone are actively driven—given tran- 
sience and re-nucleation of the downwelling, and a damage mechanism 
that provides localization and longevity of weak zones. With these basic 
ingredients, single proto-subduction zones could build and propagate 
plate tectonics by accumulating weak zones to be inherited as passive 
plate boundaries. The time to localize weak zones is several tens of Myr, 
thus if the time to form new drip downwellings is, with damage”, a few 
hundred Myr, then there is sufficient time to develop a full plate in the 
1 Gyr between initial proto-subduction and global plate tectonics. 

A similar exercise using hotter Venusian surface conditions permits 
a comparison to Earth’s putative twin*’”. Using the same material prop- 
erties but a lithosphere temperature elevated by 200-400 K above the 
Earth-like case, the damage number D is reduced by a factor of about 
10, and the healing numbers C; and C, are increased by a factor of up to 
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Figure 1 | Lithospheric flow model with damage driven by intermittent 
proto-subduction. The low in the two-dimensional horizontal pressure field P 
represents a convective downwelling, rotations of which (top row, with time t 
indicated) model nucleation of new proto-subduction. Weak regions inherited 
from previous downwellings localize into passive plate margins, indicated by 
divergence S, strike-slip vorticity Q, horizontal velocity v and viscosity fi (tows 
2-4). The final state is a four-sided plate, driven by subduction only. Bottom 
row shows evolution of maximum divergence and vorticity. Scales of distance, 
time and pressure are ~5,000 km, ~50 Myr and ~300 MPa. Extrema 
(multiplied by 100, except for log 1) are indicated on each frame, save P, which 
is always 0-1. See Extended Data Fig. 1 for contour details. 


10 (see Methods). In this case, as a downwelling migrates to various 
positions (see Fig. 2), only very faint weak zones accumulate because 
damage itself is weaker while healing is stronger, thereby resulting in 
zones of passive divergence and strike-slip vorticity about an order of 
magnitude weaker than the convergence rate. The final flow field is domi- 
nated by convergence, giving the appearance of a subduction-only sur- 
face. This result provides a simple explanation for why Venus possibly 
has subduction zones”*, but no extant plate tectonics. 

After the establishment of global plate tectonics and continental growth 
on Earth, approximately 3 Gyr ago, plate reorganizations became more 
complicated. Nevertheless, we explore an idealized model of present-day 
plate motion changes and the resulting accumulation of complexity by 
inherited weak zones. The experiment is an abstraction of the change in 
motion of the Pacific plate associated with the 47-Myr-ago Emperor- 
Hawaiian bend*”. The initial condition has one low-pressure or 
‘subduction’ zone driving a plate-like flow field, similar to the final 
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Figure 2 | Lithospheric flow model for Venus. Quantities shown are the same 
as in Fig. 1 but with parameters appropriate for Venus’ higher surface 
temperature, leading to damage parameter D = 10 (instead of 100 for the Earth 
case) and healing numbers C; = 10 and C;=10~* (instead of C;=1 and 
C;=10~° for the Earth case). Length scale is the same as in Fig. 1, the time and 
pressure scales are reduced by a factor of about 10 (assuming a 200 K increase in 
mid-lithosphere temperature from the Earth model’*”’). 


stage shown in Fig. 1. This pressure field initially drives motion north 
with azimuth 15° west of north (W15N), which is approximately the 
trajectory of the Emperor seamount chain (Fig. 3 first column). The 
pressure gradient is then rotated to W60N, roughly the azimuth of the 
Hawaiian chain (Fig. 3 second column). After the rotation in P, the 
viscosity ji and thus both the divergence S and vorticity Q retain 
‘memory of the prior plate configuration. Although highly idealized, 
the experiment yields several features suggestive of present-day Pacific 
plate motion (Fig. 3 third column). First, the passive divergent zones 
inherit the older plate geometry and form two ridges wrapping around 
the southeast corner of the plate, similar to the East Pacific Rise and the 
Pacific- Antarctic Ridge. The new vorticity field inherits all four of the 
pre-rotation weak zones, and continues to grow in magnitude; the 
resulting strike-slip motion is non-orthogonal to the new subduction 
zone, leading to oblique convergence along the northern boundary, 
similar to the Aleutian trench, strike-slip motion on the eastern bound- 
ary similar to the San Andreas system, and oblique spreading as on the 
southern Pacific plate. The northern junction between the old and new 
subduction zones is suggestive of the sharp Aleutian-Kurile arc corner, 
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Figure 3 | Lithospheric flow model with Pacific-like rotation. Quantities 
shown are the same as in Fig. 1 with the same Earth-like parameters. The initial 
configuration has established plate boundaries, as in the final step of Fig. 1, but 
in a WI5N direction analogous to the Emperor seamount chain trajectory (left 
column). The pressure field then rotates 45° to W60N analogous to the present- 
day Hawaiian island chain direction (middle column). Divergence, vorticity 
and velocity of the Earth’s present-day Pacific plate (computed up to spherical 
harmonic degree 64, using the Nuvell plate model”’) are shown for comparison 
(right column). 


and the truncated convergent corner of the prior plate is reminiscent of 
the Philippine plate (see Fig. 3 second column, viscosity field at bot- 
tom). In the end, the inheritance of damaged zones by lithospheric flow 
after a simple change in the slab-pull force leads to several basic fea- 
tures suggestive of Earth’s actual Pacific plate. 

In summary, we posit that the emergence and evolution of plate tec- 
tonics arise by the interaction ofa realistic polycrystalline grain-damage 
mechanism’ and time-dependent subduction, which together allow 
plate boundary localization and longevity. 


METHODS SUMMARY 

The theoretical model involves flow in a thin layer of grained, viscous two-phase 
fluid, wherein the phases represent olivine and pyroxene in a peridotite lithosphere. 
The layer’s rheology is a composite between dislocation and diffusion creep, depending 
on whether grains are, respectively, bigger or smaller than a critical size. Grain 
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evolution is coupled to the evolution of the interface between the two phases, which 
imposes Zener pinning surfaces that block normal grain growth depending on the 
sharpness of these surfaces. The interface sharpness evolves by the competition 
between surface-tension-driven coarsening and damage-driven sharpening (that 
is, distortion, mixing and rending), and sharpened pinning surfaces further block 
grain growth and facilitate grain-size reduction. When grains shrink enough, they 
undergo diffusion creep, which depends strongly on grain size, thus leading to a 
self-softening feedback; that is, deformation reduces grain size and viscosity, which 
enhances deformation and damage, which reduces grain size and so on. Pinning 
also prevents healing of small-grained weak zones, depending on temperature; 
thus weak zones can be inherited and reactivated even after falling dormant. Flow 
in the thin layer is driven by a low-pressure zone imposed at its base, which repre- 
sents the pull of a convective downwelling. In a pristine lithosphere, only convergent 
flow is driven into the low-pressure ‘downwelling’ and generates a damaged weak 
zone there. The pressure field is then migrated, and the previously formed, slowly- 
healing damaged weak zones are inherited by the lithospheric flow and reactivated 
to form focused divergent or strike-slip zones, or combinations thereof, depending 
on their alignment. Thus, focused plate boundaries of all types arise by coupling 
time-dependent downwelling with polycrystalline grain damage. Moreover, as the 
healing rate of weak zones depends on surface temperature, colder surfaces, as on 
Earth, promote more intense weak zone formation and inheritance than do much 
hotter surfaces, as on Venus. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Human alterations to nutrient cycles’” and herbivore communities’ 
are affecting global biodiversity dramatically’. Ecological theory pre- 
dicts these changes should be strongly counteractive: nutrient addi- 
tion drives plant species loss through intensified competition for 
light, whereas herbivores prevent competitive exclusion by increas- 
ing ground-level light, particularly in productive systems*”’. Here we 
use experimental data spanning a globally relevant range of condi- 
tions to test the hypothesis that herbaceous plant species losses caused 
by eutrophication may be offset by increased light availability due to 
herbivory. This experiment, replicated in 40 grasslands on 6 conti- 
nents, demonstrates that nutrients and herbivores can serve as counter- 
acting forces to control local plant diversity through light limitation, 
independent of site productivity, soil nitrogen, herbivore type and 
climate. Nutrient addition consistently reduced local diversity through 
light limitation, and herbivory rescued diversity at sites where it alle- 
viated light limitation. Thus, species loss from anthropogenic eutro- 
phication can be ameliorated in grasslands where herbivory increases 
ground-level light. 

The diversity of life on Earth underlies critical ecosystem functions 
and economically important services’’, and the current rapid rate of 
biodiversity loss” lends urgency to the task of understanding the forces 
that maintain biodiversity. Resources required for economic growth, 
energy and agriculture have all impacted natural ecosystems on a global 
scale. Introductions and extirpations of herbivore species, particularly 
as land is converted for grazing*’, and increased nutrient supply are 
symptoms of humanity’s global footprint’*. Such widespread altera- 
tion of herbivores and nutrient supply may jointly determine the future 
diversity of ecosystems. For example, in highly productive, eutrophic 


systems in which plant species extinction is likely to occur owing to a 
loss of ecological niches’, ecological theory predicts that herbivores can 
act to maintain local-scale plant diversity if they selectively consume the 
superior resource competitors’. Empirical studies in many ecosystem 
types find highly variable effects of herbivores on plant species diversity”, 
with the magnitude ofherbivore mediation of diversity frequently observed 
to be greatest in regions of high ecosystem productivity’’"’”. Thus, eco- 
system productivity and its regional climate drivers have been observed 
to mediate the local-scale effects of herbivores on plant diversity. The 
availability of ground-level light is the commonly postulated mech- 
anism modulating the relationships among plant diversity, herbivory 
and observed gradients of plant productivity. However, these patterns 
and predictions have primarily emerged from studies across observed 
gradients of productivity or reviews and meta-analyses based on an 
extremely limited number of single-site experiments that manipulate 
both nutrients and herbivory, often with different methods'’’*"”. In 
most of these studies, ground-level light has not been measured. Thus, 
the generality of these effects is only suggestive, and the mechanisms 
underlying the observed relationships remain elusive. 

Local-scale plant diversity is likely to be maintained through an inter- 
dependent system of interactions with multiple plant species that are 
consumed by the same herbivores and that compete for light and nutri- 
ents. In particular, terrestrial plants compete for nutrients and light at 
the scale of interactions among individuals (neighbourhood of ~1 m? 
in grasslands), and one important mechanism for maintaining local 
coexistence is a trade-off in competitive ability for nutrients (below- 
ground) and for carbon via light (aboveground)'* ”. Nutrient enrichment 
can lead to competitive exclusion of inferior competitors for light’?”®, 
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Figure 1 | Geographic and climatic distribution of experimental sites. 

a, Locations of the 40 ‘Nutrient Network’ sites at which the full factorial 
experiment was replicated. At each site, nutrient supply and plot access by large 
herbivores were manipulated. Sites contributed 1-4 years of plot-scale 


but herbivores can remove plant biomass, potentially alleviating unders- 
tory light limitation. However, herbivory creates another axis of potential 
trade-offs among plant species, involving investment in rapid growth 
and light capture versus investment in defence against herbivory” **. 
These interactions result in a dynamic local community in which 
composition responds quickly to changes in the strength of nutrient 
limitation or herbivory**”*. In eutrophied systems, where nutrient lim- 
itation is alleviated and productivity is increased, theory predicts that 
these tradeoffs among plant strategies will simplify the plant community 
to species sharing an herbivore and competing for a single resource— 
light”. 

We used an experiment replicated at 40 sites on 6 continents (Fig. 1) 
to test the hypothesis that herbivores mediate species losses caused by 
nutrient addition by increasing ground-level light, particularly in eutrophic 
and highly productive systems. We manipulated herbivores and nutri- 
ents using a factorial experiment (nutrient addition X exclusion of her- 
bivores > about 50 g, see Methods and ref. 27 for details) replicated in 
40 sites dominated by herbaceous plants, spanning broad environmental 
gradients of productivity (114 to 1,976 gm 7 yr _*), precipitation (mean 
annual precipitation from 224 to 1,898 mmyr '), temperature (mean 
annual temperature from 0 to 22.1 °C), and soil nitrogen (mean per- 
centage of soil N from 0.018 to 1.182%) (Fig. 1 and Extended Data 
Table 1). In each plot, we measured local-scale responses of produc- 
tivity, light and the number of plant species (diversity) using standard 
methods”. We also examined site-level covariates including precipita- 
tion, temperature, herbivory intensity, soil nitrogen and atmospheric- 
nitrogen deposition rates. Although most sites provided 3 years of data, 
a subset of sites contributed 4 years of post-treatment data, and a few 
sites, established later, provided only 1 or 2 years of data (Extended Data 
Table 1). Effects of the experimental treatments were broadly consistent 
across all years of treatments (Extended Data Fig. 1); we present results 
from the three year duration in the main text, for a balance of spatial 
and temporal extent (see Extended Data Tables 2-8 for statistical mod- 
els that describe responses in data collected 3 years after initiation of 
the experimental treatments). 

Our results support an important mechanism by which nutrients 
lead to diversity loss. In particular, nutrient addition caused declines in 
diversity (Fig. 2a and Extended Data Table 2, P< 0.001), increased total 
plant biomass (Fig. 2b and Extended Data Table 3, P< 0.001), and 
increased light limitation (reduced transmission of photosynthetically 
active radiation (PAR) to ground level, Fig. 2c and Extended Data Table 4, 
P<0.001) both inside and outside of fences. Ground-level light avail- 
ability, a function of light interception by live and dead biomass, declined 
with increasing total biomass (Fig. 2d and Extended Data Table 5, 
P<0.001). This result is consistent with eutrophication-induced loss 
of niches for coexistence*”'?”* and demonstrates the generality of eutro- 
phication as a primary force controlling the diversity of grassland com- 
munities by reducing ground-level light”. 
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photosynthetically active radiation (PAR), plant species richness and total plant 
biomass data. b, Study sites represent a wide range of mean annual temperature 
and precipitation (n = 40). Additional site details are provided in Extended 
Data Table 1. 


Although the removal of vertebrate herbivores did not have consistent 
effects on diversity (Fig. 2a, P = 0.522) or biomass (Fig. 2b, P = 0.803), 
herbivore removal increased light limitation (Fig. 2c, P = 0.013). The 
lack ofa consistent effect of herbivore removal on diversity across these 
globally distributed grassland sites (Fig. 2a) reflects the broad range of 
positive and negative effects found in past studies''’””*. However, a 
critical assumption underlying the hypothesis that grassland diversity 
is jointly controlled by nutrient supply and consumers is that diversity 
should be rescued consistently by herbivory. In both ambient and 
eutrophied plots, herbivory should lead to greater diversity because 
herbivores can alleviate ground-level light limitation, thereby increas- 
ing the number of possible tradeoffs (nutrients, light) that maintain 


plant species diversity”"’. 
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Figure 2 | Mixed-effects model parameters showing average response of 
plots (n = 360) to 3 years of nutrient addition and herbivore exclusion by 
fencing. a—d, Nutrient addition (Nut) and herbivore exclusion by fencing 
(Fnc) represent the difference from control plots; Nut*Fnc is the additional 
effect of combining nutrients and fences (the interaction). Error bars represent 
the 95% confidence interval. a Plot-scale diversity declines with nutrients, but is 
not consistently altered with fencing. b, Total biomass increases with nutrients, 
but is not consistently affected by fencing. c, Exclusion of herbivores and 
addition of nutrients independently reduce ground-level light. d, The 
proportion of light reaching the ground declines with increasing aboveground 
biomass. 
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We tested whether the inconsistent herbivore effects on plant diver- 
sity reflected variable herbivore effects on light and found that plant 
diversity increased quantitatively with herbivore effects on ground- 
level light (Fig. 3, P = 0.003); nutrient addition did not modify this 
relationship (Extended Data Fig. 2). Sites with the greatest effects of 
herbivores on light and diversity spanned four continents and were 
dominated by larger vertebrates including wild and domestic ungu- 
lates, macropods, and lagomorphs (Fig. 3 and Extended Data Table 1). 
Thus, we show that under conditions of experimentally imposed nutri- 
ent supply gradients and among sites spanning greater than a 26-fold 
observed productivity gradient, herbivores rescue plant diversity to the 
extent that they enhance ground-level light, regardless of herbivore 
identity or nutrient supply. 

Herbivore effects on plant diversity were not related to variation in 
soil nitrogen, nitrogen deposition rates, or site productivity. The change 
in ground-level light caused by removing herbivores was greatest at 
sites with high herbivory intensity (estimated as change in biomass in 
response to fencing; P = 0.006, AIC (Akaike information criterion)- 
weighted importance = 0.98, Extended Data Table 6). Herbivory inten- 
sity, in turn, was greatest at sites with a cool dry-season climate (P = 0.01, 
importance = 1.0, Extended Data Table 7) and sites at which the annual 
temperature is relatively warm (P = 0.03, importance = 0.52) and con- 
stant (P = 0.05, importance = 0.63). However, the change in diversity 
due to herbivores was best described by their effect on ground-level 
light (P = 0.012, importance = 1.0, Extended Data Table 8); site-level 
climate, productivity, soil nitrogen, nitrogen deposition rates, and her- 
bivory intensity were not significant descriptors of changes in site-level 
plant diversity (P > 0.05 and importance < 0.25 for these factors). 
Thus, climate, which predicts herbivory intensity, places an ultimate 
constraint on the effects of herbivores on plant diversity, but local plant 
diversity is determined primarily through herbivore effects on ground- 
level light. These experimental data demonstrate that across a wide 
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Figure 3 | Effects of herbivore exclusion by fencing on mean grassland 
species richness and the mean proportion of PAR reaching ground level at 
29 sites after 3 years of treatment. Log response ratios (LRRs) compare light 
and richness inside and outside fences. The grey region indicates the 95% 
confidence interval around the regression (P = 0.003). Extended Data Table 1 
shows site number codes. Herbivore exclusion generally leads to reduced 
ground-level light (<0 on x axis) coupled with reduced grassland species 
richness (<0 on y axis). Herbivore effects are consistent across fertilized and 
unfertilized plots (Extended Data Fig. 2). 
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range of the world’s grasslands, herbivores serve as an important force 
to maintain plant diversity in grassland ecosystems in which they increase 
ground-level light availability, consistent with the prediction that light 
limitation is a critical factor controlling grassland species diversity®'’, 
but counter to the interpretation of nutrient supply or ecosystem pro- 
ductivity as the dominant force constraining herbivore effects on local 
plant diversity'""”. 

A predictive understanding of the forces controlling grassland diver- 
sity is critical for informing issues of environmental and agricultural 
sustainability on all continents because of the steady conversion of the 
world’s grasslands for use in livestock production*’. Previous work 
observed that herbivores have the greatest effects on diversity in high 
productivity ecosystems'’"'®, whereas the experimental results pre- 
sented here demonstrate that in grasslands in which herbivores increase 
ground-level light, they rescue plant diversity regardless of nutrient 
addition or environmental productivity. This result is consistent with 
ecological theory”’®”°, simultaneously providing greater mechanistic 
understanding” and clarifying the apparent overall lack of response of 
plant diversity to herbivory”’. The results from our global-scale experi- 
ment suggest that in cases in which anthropogenic nutrient inputs to 
natural systems are high, grassland plant diversity will decline. However, 
in grasslands in which herbivory leads to increased ground-level light 
availability, we expect that these diversity losses will be ameliorated. 


METHODS SUMMARY 


All 40 herbaceous-dominated (‘grassland’) sites in the analysis (Fig. 1) implemen- 
ted a full factorial combination of nutrient addition (‘control’ or ‘all nutrients’) and 
herbivore exclusion (‘control or ‘fenced’). The experimental design, treatments and 
sampling procedures to document plant diversity, biomass, light interception by the 
canopy, and soil chemistry were replicated at all sites, as detailed in ref. 27 and 
described in the Methods. Climate data were derived for all sites using the WorldClim 
database (version 1.4)°°. All sites contributed at least 1 year of post-treatment data. 
Light, biomass and species richness were measured concurrently at 29 sites con- 
tributing 3 or more years of data (Extended Data Table 1); we focus on these in our 
main analyses. 

We developed mixed-effects models with ‘site’ and ‘block nested within site’ as 
random effects using R (version 3.1; R Foundation for Statistical Computing). Analyses 
in Fig. 2a, b were performed using the nlme R library; we used the Ime4 R library 
when the proportion of light (binomial error structure) was the response variable 
(Fig. 2c, d). We used the glm R function to analyse changes in each factor estimated 
as log(S/+/Ss—), where S;; is species richness or proportion PAR reaching the 
ground in fenced plots and S;— represents the comparable control plot measure- 
ment (Fig. 3). Finally, we used the dredge function in the MuMIn R library to assess 
the relative importance of potentially covarying site-level factors. We fit all possible 
models, deriving parameter values with error, and kept those models not more 
than four AICc (AIC corrected for small sample size) units higher than the best 
model. For each parameter, we further estimated AIC-weighted ‘importance’, a 
unitless metric indicating summed Akaike information across all models in which 
the parameter appears. Importance ranges from 0 (parameter not given explanatory 
weight) to 1 (parameter in all top models). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Site selection. The Nutrient Network (NutNet) is a network of researchers working 
at herbaceous-dominated (‘grassland’) sites in countries spread across six conti- 
nents, who are performing coordinated, globally distributed observations and 
experiments. The full experimental design is detailed here and in ref. 27. All 
NutNet sites are located in areas dominated by herbaceous vegetation representing 
the regional species composition (for example, shortgrass steppe, tallgrass prairie), 
referred to as ‘grassland’ here. The NutNet experimental design analysed here is a 
completely randomized block design with four 5 X 5 m plots per block and three 
replicate blocks at most sites (with blocks ranging from 1 (n = 1 site) to 5 (n =3 
sites)). Within-site replication is used to determine relative strength of responses, 
but the main experimental replication comes from the number of sites. 
Experimental treatments. All 40 sites included in the current analysis (Fig. 1) 
implemented a full factorial combination of nutrient addition (‘control’ or ‘all 
nutrients’) and consumer density (‘control’ or ‘fenced’) for a total of four treat- 
ments in randomized, complete blocks. Standard nutrient addition and sampling 
protocols were carefully replicated among sites”. All sites collected data before 
application of treatments (year 0); most sites began sampling in 2007, but a subset 
began sampling in subsequent years. Nutrient and fencing treatments’” were 
implemented the following year (year 1) and have been maintained continuously 
since then. All sites contributed at least 1 year of post-treatment data; 39 of these 
sites contributed 3 or more years of post-treatment data. Light, biomass and species 
richness measurements (see below) were conducted concurrently at 29 sites con- 
tributing 3 or more years of data; we focus on these in our main analyses. 

Fences designed to exclude aboveground mammalian herbivores (> about 50 g) 
were erected around two plots in each block, one receiving a nutrient combination 
(described next) and one used as an ambient nutrient control plot. Fences were 
230cm tall with the lower 90cm surrounded by 1-cm woven wire mesh. An 
additional 30-cm outward-facing flange was stapled to the ground to exclude 
digging animals (for example, rabbits, voles), although not fully subterranean 
animals (for example, gophers, moles). Four strands of barbless wire were strung 
at equal vertical distances above the wire mesh. Exclosures were built at all sites 
before the second year of plant growth. Although most (33) sites built fences 
exactly to these specifications, a few sites (8) faced challenges (for example, snow- 
pack, materials availability, elephant activity) that required minor modifications. 
Modifications are described in Supplementary Table 1. 

Nitrogen, phosphorus and potassium were applied annually to experimental 
plots; micronutrients were applied once at the start of the experiment to avoid 
toxic levels from over-application. Nutrient addition rates and sources were: 10 g 
Nm “yr! as timed-release urea ((NH2)2CO), 10gPm “yr? as triple-super 
phosphate (Ca(H,PO,)2), 10 gKm “yr! as potassium sulphate (K,SO,) and 
100 gm” yr! of a micronutrient mix of Fe (15%), S (14%), Mg (1.5%), Mn 
(2.5%), Cu (1%), Zn (1%), B (0.2%) and Mo (0.05%). 

Each sampling area was separated by at least 1.5m from neighbouring plots 
(1m walkway and 0.5 m within-plot buffer), which served to minimize indirect 
effects of treatments in one plot on adjacent plots (for example, nutrient leaching, 
shading or mycelial networks). Note that the nutrient and fence treatments had 
strong measurable effects on plant responses (for example, biomass, richness) 
indicating that plots and measurements were sufficiently sized and spaced. 
Species diversity. All NutNet sites followed standard sampling protocols. A ran- 
domly designated 1X 1m subplot within each 5X 5m plot was permanently 
marked and sampled annually at peak biomass. In the 1 X 1 m permanently marked 
subplot, cover was estimated visually to the nearest 1% for every species over- 
hanging the subplot; cover estimates also included woody over-story, litter, bare 
soil and rock. 

Productivity. Adjacent to the permanent 1 X 1m cover subplot, standing crop 
was estimated destructively by clipping at ground level all aboveground biomass of 
individual plants rooted within two 1 X 0.1m strips (for a total of 0.2m”). All 
biomass was dried at 60 °C to constant mass before weighing to the nearest 0.01 g. 
Weights were multiplied by five to estimate grams per square meter. Pre-treatment 
data (year 0) from each site in this study demonstrate high correlation 0.976 (95% 
confidence interval: 0.955-0.987) between year 0 mean plant biomass in the con- 
trol plots (n = 3 for most sites) and year 0 mean plant biomass for the site as 
characterized by all plots (n = 30 for most sites). 

Light interception. At the time of biomass clipping, photosynthetically active 
radiation (PAR, fmol photons per m” per s) was determined at approximately 
solar noon (between 11:00 and 14:00). Two measurements, integrated across a 1-m 
light ceptometer, were made at ground level from opposite corners of each 1-m? 
plant-diversity plot, diagonal to each other, and one measurement was made above 
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the canopy of each plot. We calculated the proportion of PAR available at ground 
level as the ratio of the average of the ground level to the ambient measurements. 
Climate. We used the WorldClim database to derive comparable climate data for 
all sites (version 1.4; http://www.worldclim.org/bioclim). This database provides 
high-resolution interpolated global climate data for stations with 10 to 30 years of 
data*’. To examine climate covariates with site-level fencing effects on net con- 
sumption (biomass inside minus outside of fences), light and richness, we used 
climate variables that summarized the mean and seasonality of site-level temper- 
ature and precipitation. These were (BIO designator indicates the variable code in 
the WorldClim database): mean annual temperature (°C; BIO1), mean maximum 
temperature of the warmest month (°C; BIO5), mean minimum temperature of 
the warmest month (°C; BIO5), mean annual precipitation (mm per year; BIO12), 
precipitation variability (coefficient of variation in precipitation among months; 
BIO15), rainfall-potential evapotranspiration (mm per month), temperature vari- 
ability (standard deviation of temperature among months; BIO4), mean temper- 
ature in the wettest quarter (°C; BIO8), and mean temperature in the driest quarter 
(°C; BIO9). 
Nitrogen deposition. We used modelled nitrogen deposition rates”! to determine 
the annual atmospheric N deposition (kgNha | yr~') for each experimental site 
(associated with model output based on latitude and longitude). N deposition was 
modelled based on existing measurements and future projections using a global 
three-dimensional chemistry-transport model (TM3)*’. The spatial resolution of 
the model, 5 degrees longitude by 3.75 degrees latitude, and the resolution of the 
output grid (50 X 50 km sub-grids), provide sufficient resolution to distinguish 
site-level variation in annual N deposition among our experimental sites. 
Statistical analysis. To explore the independent and interactive effects of verte- 
brate herbivory and nutrient supply on species diversity, total biomass and photo- 
synthetically active radiation, we developed mixed effects models with site and 
block within site as random effects. Analyses in Fig. 2a, b were performed using the 
nlme library in R (R version 3.1; R Foundation for Statistical Computing); for 
Fig. 2c, d, we used the Ime4 R library to fit models in which proportion of light was 
the response variable (binomial error structure and a proportion bounded between 
0 and 1). Although not presented here, models using logit and arcsin square root 
transformations of the data generated qualitatively identical results. Site and block 
nested within sites were included in all regressions. We also estimated the effects of 
herbivores on richness and light at each site as the change in these factors resulting 
from fencing in both fertilized and unfertilized plots. Change in each factor was 
estimated as the log ratio of the treatment divided by the control, log(S;,/S/_), 
where S;+ is the species richness or proportion PAR reaching the ground in fenced 
plots and S;_ is the species richness or proportion PAR reaching the ground in 
control plots. We examined residuals to ensure homogeneity of variance. Because 
of missing PAR data for a few sites, this analysis included 29 sites. The relation- 
ships were independent of whether plots had been fertilized (see Extended Data 
Fig. 2 for more details), so we present a final model of site means including both 
fertilized and unfertilized plots in the main text. The log ratio analyses were 
performed using the glm library in R (R version 3.1; R Foundation for Statistical 
Computing). We generated models separately for each experimental year (year 1, 
n= AO; year 2, n = 38; year 3, n = 34; year 4, n = 30). The results were broadly 
consistent (Extended Data Fig. 1), so we present results from 3 years of manipula- 
tions in the main text and results comparing 1 to 4 years of manipulations, greater 
spatial (year 1 and year 2) or temporal (year 4) extent, in Extended Data Fig. 1. 
Finally, to examine the effects of climate and site productivity as predictors for 
site-level mean herbivore effects on biomass, ground-level light and plant richness, 
we analysed site-level mean values using model averaging’. The model averaging 
approach allowed us to assess the relative importance of a range of covarying factors, 
and to recognize explicitly that there could be a suite of similar models. Prior to 
fitting the models, all of the independent variables were standardized using the 
standardize function in the arm R library. ‘Importance’ in this modelling approach 
is a term representing the relativized sum of the Akaike weights summed across all 
of the models in which the parameter appears that are within four AICc (AIC cor- 
rected for small sample size) units of the model with the lowest AIC. Importance 
ranges from 0 (parameter not given explanatory weight) to 1 (parameter in all top 
models). We used the dredge function in the MuMIn R library to fit all possible 
models. We estimated parameter values, errors, and AIlC-weighted importance 
using the model.avg function in the MuMIn R library. 


31. The Oak Ridge National Laboratory Distributed Active Archive Center. http:// 
daac.ornl.gov/ (accessed 25 August 2011). 

32. Grueber, C. E., Nakagawa, S., Laws, R. J. & Jamieson, |. G. Multimodel inference in 
ecology and evolution: challenges and solutions. J. Evol. Biol. 24, 699-711 
(2011). 
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Extended Data Figure 1 | Regression parameters for treatment effects. 

a-c, All available data are shown for richness (a), total biomass (b) and ambient 
light reaching ground level (c). Error bars represent +2 s.e.m. Treatment 
years and their associated sample sizes are shown in each panel. One- and 
two-year models represent greater spatial extent and replication, but reduced 
temporal extent compared to Fig. 2 in the main text. Four-year models 
represent longer temporal effects, but reduced spatial extent, particularly for 
light measurements. All models were fitted as in Extended Data Tables 2-4 and 
described in the Methods. 
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Extended Data Figure 2 | Fertilization does not alter the relationship 

between ‘fence’ effects on light and diversity. The log response ratio (LRR) 
model of the effect of fences (herbivore exclusion) on richness and light (year 3 
data) demonstrates no additional effect of nutrient addition on the relationship 
shown in Fig. 3. The grey region indicates the 95% confidence interval around 
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the regression. The effect of fences on ground-level light predicts changes in 
plot-scale species richness (P = 0.00254), whereas fertilization is not included 
in the final statistical model of this relationship (P > 0.05). Thus, the magnitude 
of the effect of grazers on richness is dependent on the magnitude of their effect 
on light regardless of whether a plot has been fertilized. 
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Extended Data Table 1 | Sites contributing experimental data 


Control Control plot 
Exp’t MAT MAP Mean plot mass richness 
Site # Site code* Countr Continent Lat. Long. ears °c mm soil %N m? m 
1 azi.cn China Asia 33.670 101.870 4 0.0 684 0.556 411.40 33.17 
2 barta.us USA N Amer. 42.245 -99.652 4 8.7 597 0.057 442.90 13.75 
3 bldr.us* USA N Amer. 39.972 -105.234 4 9.7 425 0.089 283.45 8.10 
4 bnch.us* USA N Amer. 44.277 -121.968 4 5.5 1647 0.537 281.87 9.43 
5 bogong.au* Australia Australia -36.874 147.254 4 5.7 1592 0.722 554.27 18.93 
6 burrawan.au* Australia Australia -27.735 151.140 4 18.4 683 0.087 251.18 10.33 
7 cbgb.us* USA N Amer. 41.785 -93.385 3 9.0 855 0.063 870.38 8.08 
8 cder.us* USA N Amer. 45.401 -93.201 4 6.3 750 0.039 270.40 10.55 
9 cdpt.us USA N Amer. 41.200 -101.630 4 9.3 454 0.097 233.59 11.72 
10 cowi.ca* Canada N Amer. 48.460 -123.380 4 9.8 764 0.374 1018.63 4.80 
11 elliot.us* USA N Amer. 32.875 -117.052 4 17.0 224 0.165 463.98 11.25 
12 frue.ch Switz. Europe 47.113 8.542 3 9.0 1150 0.364 1046.54 13.58 
13 gilb.za* S Africa Africa -29.284 30.292 3 13.1 926 1.182 223.23 27.08 
14 hall.us* USA N Amer. 36.872 -86.702 4 13.6 1282 0.139 948.55 6.13 
15 hart.us* USA N Amer. 42.724 -119.498 4 7.4 272 0.109 113.82 10.33 
16 hero.uk UK Europe 51.411 -0.639 4 9.9 692 NA 740.88 16.00 
17 hnvr.us* USA N Amer. 43.419 -72.138 3 6.4 1033 0.395 524.58 9.50 
18 hopl.us* USA N Amer. 39.013 -123.060 4 12.3 1127 NA 347.28 18.43 
19 kiny.au* AUS Australia -36.200 143.750 4 15.5 426 0.120 490.11 11.20 
20 _koffler.ca Canada N Amer. 44.024 -79536 1 Be 815 NA 964.47 9.39 
21 konz.us* USA N Amer. 39.071 -96.583 4 12.0 872 NA 433.15 15.07 
22 lancaster.uk* UK Europe 53.986 -2.628 3 8.0 1322 1.180 126.97 9.50 
23 look.us* USA N Amer. 44.205 -122.128 4 4.8 1898 1.170 332.69 7.20 
24 marc.ar Argent. SAmer. __-37.715 57.425 1 13.9 838 NA 953.79 11.90 
25 mcla.us* USA N Amer. 38.864 -122.406 4 13.5 867 NA 554.20 5.33 
26 mtca.au* AUS Australia -31.782 117.611 4 17.3 330 0.092 115.40 14.66 
27 pape.de Germany Europe 53.086 7.473 4 8.9 783 0.190 1292.78 5.00 
28 rook.uk UK Europe 51.406 -0.644 4 9.8 706 NA 233.19 10.92 
29 sage.us* USA N Amer. 39.430 -120.240 4 5.7 882 0.714 159.92 12.36 
30 saline.us USA N Amer. 39.050 -99.100 4 11.8 605 NA 490.44 10.75 
31 sava.us* USA N Amer. 33.344 -81.651 4 17.3 1194 0.018 393.80 10.30 
32 sereng.tz* Tanzania Africa -2.255 34.513 3 22.1 854 O33 362.90 12.58 
33 sgs.us* USA N Amer. 40.817 -104.767 4 8.4 364 0.081 129.08 6.89 
34 sier.us* USA N Amer. 39.236 -121.284 4 15.6 935 0.171 378.29 8.32 
35 smith.us* USA N Amer. 48.207 -122.625 4 9.8 597 0.521 440.24 20.67 
36 spin.us* USA N Amer. 38.136 -84.501 4 12.5 1140 0.261 425.16 9.53 
37 trel.us* USA N Amer. 40.075 -88.829 3 11.0 982 0.273 1976.39 4.22 
38 tyso.us USA N Amer. 38.519 -90.565 2 12.5 997 0.261 599.34 10.33 
39 unc.us* USA N Amer. 36.008 -79.020 4 14.7 1162 0.162 427.24 11.73 
40 valm.ch* Switz. Europe 46.631 10.372 4 0.3 1098 0.498 253.20 26.85 


Nutrient Network sites with 1 to 4 years of experimental manipulations of both nutrient supply and herbivore access to plots. Control plot mass and control plot richness, the mean total biomass and mean number 
of species per m? in un-manipulated plots across all sample years; experiment years, the number of years of experimental data collected at each site for the current analyses; mean soil %N, the average percentage 
of nitrogen in soil of the site (across all plots, measured before establishment of experimental treatments); MAP is mean annual precipitation; and MAT is mean annual temperature. Lat. and Long. provide 
geographic coordinates of each site. 

* Sites used in the main text analyses (year 3 measurements of richness, biomass and photosynthetically active radiation (PAR)). 
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Extended Data Table 2 | Statistical model for treatment effects on richness after 3 years of treatment (n = 29) as a function of fertilization by 
N, P and K and micronutrients, herbivore exclusion by fencing, and their interaction 


Value Std.Error DF tvalue p-value 
(Intercept 11.965. 1.085 264 11.031  <0.0001 
fertFERTILIZED -1.987 0.391 264 + -5.075 —<0.0001 
excloseFENCE 0.321 0.397 264 -0809 0.4194 
fertFERTILIZED:excloseFENCE 0.166 0.561 264 -0296 0.7677 


Linear mixed-effects model was fit by maximum likelihood. Random effects in model were ‘site’ (s.d. = 5.60) and ‘block within site’ (s.d. = 1.22). The intercept is the estimated mean value of the control plots (no 
fence, no nutrients). This model is shown in Fig. 2a. 
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Extended Data Table 3 | Statistical model for treatment effects on biomass after 3 years of treatment (n = 29) as a function of fertilization by 
N, P and K and micronutrients, herbivore exclusion by fencing, and their interaction 


Value Std.Error DF _ t-value p-value 
(Intercept) 5.803 0.143 264 40.497 <0.0001 
fertFERTILIZED 0.379 0.061 264 6.235 <0.0001 
excloseFENCE 0.027 0.062 264 0.438 0.6619 
fertFERTILIZED:excloseFENCE 0.113 0.087 264 1.296 0.1961 


Linear mixed-effects model was fit by maximum likelihood. Random effects in model were ‘site’ (s.d. = 0.73) and ‘block within site’ (s.d. = 0.20). The intercept is the estimated mean value of the control plots (no 
fence, no nutrients). This model is shown in Fig. 2b. 
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Extended Data Table 4 | Statistical model for treatment effects on proportion of photosynthetically active radiation (PAR) reaching ground 


level after 3 years of treatment (n = 29) as a function of fertilization by N, P and K and micronutrients, herbivore exclusion by fencing, and their 
interaction 


Value Std.Error DF t-value p-value 
(Intercept) 0.399 0.046 264 8.615 <0.0001 
fertFERTILIZED -0.109 0.023 264 -4.850 <0.0001 
excloseFENCE -0.060 0.023 264 -2.614 0.0095 


fertFERTILIZED:excloseFENCE -0.009 0.032 264 -0.290 0.7721 


Linear mixed-effects model was fit by maximum likelihood. Random effects in model were ‘site’ (s.d. = 0.23) and ‘block within site’ (s.d. = 6.24 x 10 ~®) The intercept is the estimated mean value of the control 
plots (no fence, no nutrients). This model is shown in Fig. 2c. 
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Extended Data Table 5 | Statistical model for biomass effects on ground-level proportion of photosynthetically active radiation (PAR) after 
3 years of treatment (n = 29) as a function of total plot-scale biomass 


Estimate Std.Error z-value Pr(>|z 
(Intercept) -0.584 0.395 -1.479 0.1391 
fertFERTILIZED -1.108 0.361 -3.073 0.0021 
excloseFENCE -0.993 0.363 -2.736 0.0062 
fertFERTILIZED:excloseFENCE 0.166 0.551 0.302 0.7626 


Generalized linear mixed-effects model with logit link and binomial errors was fit by maximum likelihood. Random effects in model were ‘site’ (s.d. = 1.68) and ‘block within site’ (s.d. = 1.06 x 10-5). The intercept 
is the estimated mean value of the control plots (no fence, no nutrients). This model is shown in Fig. 2d. 
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Extended Data Table 6 | Effects of climate, nitrogen deposition, soil nitrogen and site productivity on change in ground-level light across 
experimental fencing treatments after 3 years of treatment 


Adjusted Relative 
Estimate Std. Error SE zvalue_p-value Importance 
(Intercept) -0.30123 0.07368 0.07827 3.849 <0.001 1.00 
Change in mass across fence -0.56329 0.19684 0.20635 2.730 0.006 0.98 
Annual Precipitation Variability 0.29938 0.16493 0.17463 1.714 0.086 0.51 
Wet Quarter Temperature -0.42281 0.28903 0.29826 1.418 0.156 0.40 
Mean Annual Temperature -0.51396 0.46975 0.48080 1.069 0.285 0.36 
Maximum Temperature 0.53099 0.35922 0.37327 1.423 0.155 0.35 
Minimum Temperature -0.28435 0.43556 0.44519 0.639 0.523 0.29 
Dry Quarter Temperature -0.14111 0.54170 0.55158 0.256 0.798 0.18 
Annual Temperature Variability 0.12017 0.39288 0.40263 0.298 0.765 0.17 
Annual N-Deposition Rate -0.14619 0.18187 0.19180 0.762 0.446 0.08 
Rainfall Potential Evapotranspir. -0.06077 0.22652 0.23682 0.257 0.797 0.06 
Net primary production in Y3 -0.05213 0.16746 0.17748 0.294 0.769 0.04 


Summary results of change in site-level means of ground-level light after model averaging; all factors are normalized. Soil nitrogen was included in the original models, but was never significant so was dropped 


from final models because of missing values. 
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Extended Data Table 7 | Effects of climate, nitrogen deposition, soil nitrogen and site productivity on site-level mean biomass change across 
experimental fencing treatments after 3 years of treatment 


Adjusted Relative 
Estimate Std. Error SE zvalue p-value Importance 
(Intercept) 0.12111 0.04966 0.05247 2.308 0.021 1.00 
Dry Quarter Temperature -0.66911 0.26088 0.27013 2.477 0.013 1.00 
Annual Temperature Variability -0.58513 0.28798 0.29564 1.979 0.048 0.63 
Annual Precipitation Variability -0.10096 0.13851 0.14446 0.699 0.485 0.63 
Minimum Temperature 0.65583 0.29851 0.30830 2.127 0.033 0.52 
Wet Quarter Temperature -0.28870 0.29658 0.30660 0.942 0.346 0.29 
Mean Annual Temperature 0.34511 0.40053 0.40959 0.843 0.400 0.29 
Net primary production in Y3 -0.11616 0.10543 0.11130 1.044 0.297 0.15 
Mean Annual Precipitation 0.01959 0.11602 0.12206 0.160 0.873 0.10 
Rainfall Potential Evapotranspir. -0.06785 0.16203 0.16898 0.402 0.688 0.09 
Annual N-Deposition Rate -0.09961 0.14467 0.15200 0.655 0.512 0.09 


Summary results of site-level means of biomass off-take after model averaging; all factors are normalized. Soil nitrogen was included in the original models, but was never significant so was dropped from final 
models because of missing values. 
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Extended Data Table 8 | Effects of climate, nitrogen deposition, soil nitrogen, site productivity, and change in light on change in site-level 
mean plant species richness across experimental fencing treatments after three years of treatments 


Adjusted Relative 
Estimate _Std. Error SE zvalue_p-value_Importance 
(Intercept) -0.05642 0.03317 0.03511 1.607 0.108 1.00 
Change in light across fence 0.21878 0.08325 0.08734 2.505 0.012 1.00 
Minimum Temperature 0.16634 0.20331 0.20831 0.798 0.425 0.21 
Maximum Temperature 0.15296 0.14054 0.14489 1.056 0.291 0.19 
Change in mass across fence -0.0941 0.08964 0.09495 0.991 0.322 0.13 
Annual Temperature Variability -0.0571 0.26403 0.26864 0.213 0.832 0.13 
Rainfall Potential Evapotranspir. -0.2268 0.32134 0.32656 0.695 0.487 0.13 
Mean Annual Temperature 0.17278 0.18625 0.19128 0.903 0.366 0.11 
Annual N-Deposition Rate -0.063 0.08599 0.09065 0.695 0.487 0.08 
Mean Annual Precipitation 0.18256 0.28651 0.29187 0.625 0.532 0.07 
Annual Precipitation Variability 0.05612 0.07807 0.08242 0.681 0.496 0.06 
Dry Quarter Temperature 0.05108 0.12828 0.13300 0.384 0.701 0.06 
Wet Quarter Temperature 0.03975 0.07086 0.07498 0.530 0.596 0.04 
Net primary production in Y3 0.04017 (0.07414 0.07845 0.512 0.609 0.04 


Summary results of change in site-level means of plant richness after model averaging; all factors are normalized. Soil nitrogen was included in the original models, but was never significant so was dropped from 


final models because of missing values. 
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Studies of experimental grassland communities” have demon- 
strated that plant diversity can stabilize productivity through spe- 
cies asynchrony, in which decreases in the biomass of some species 
are compensated for by increases in others’*. However, it remains 
unknown whether these findings are relevant to natural ecosystems, 
especially those for which species diversity is threatened by anthro- 
pogenic global change*". Here we analyse diversity-stability rela- 
tionships from 41 grasslands on five continents and examine how 
these relationships are affected by chronic fertilization, one of the 
strongest drivers of species loss globally’. Unmanipulated commu- 
nities with more species had greater species asynchrony, resulting in 
more stable biomass production, generalizing a result from bio- 
diversity experiments to real-world grasslands. However, fertiliza- 
tion weakened the positive effect of diversity on stability. Contrary 
to expectations, this was not due to species loss after eutrophication 
but rather to an increase in the temporal variation of productivity in 
combination with a decrease in species asynchrony in diverse com- 
munities. Our results demonstrate separate and synergistic effects 
of diversity and eutrophication on stability, emphasizing the need 
to understand how drivers of global change interactively affect the 
reliable provisioning of ecosystem services in real-world systems. 
Rapid declines in plant diversity have prompted concern over the 
consequences for the stability of ecosystem functioning and the reliable 
provisioning of ecological services”'*’’. The first attempts to address this 
concern used observational studies of natural variation in diversity'*"° 
and were limited in their ability to separate effects of diversity from 
other confounding factors’®. In response, more recent studies have 
directly manipulated diversity in experimentally established commu- 
nities (that is, biodiversity experiments) to assess its impact on eco- 
system functioning, particularly above-ground net primary production 
(ANPP)'’. Numerous biodiversity experiments have shown that greater 
species diversity promotes a greater stability of productivity over time**®, 
with asynchronous response of species to environmental fluctuations as 
an important underlying mechanism’. There would be no stabilizing 


effect if species fluctuated in perfect synchrony. However, asynchrony 
in species response to environmental fluctuations causes declines in 
the biomass of some species to be compensated for by increases in others, 
thus buffering temporal fluctuation in the productivity of the whole 
community. Species asynchrony can increase the stability of aggregate 
functions in species-rich communities, because compensatory effects are 
more likely to occur when the species pool is larger and more diverse’””. 

Biodiversity experiments, in turn, have their own limitations, and their 
relevance to natural grassland ecosystems is debated'""*"°. For example, 
experimental gradients of diversity are usually assembled randomly from 
a local species pool, whereas in natural systems composition and diversity 
are influenced by a variety of factors including nutrient availability, 
climatic conditions and anthropogenic land use”""’. It is also likely that 
diversity is not the only, or even the primary, driver of the stability of 
ANPP”; however, few experiments have simultaneously addressed 
changes in both biodiversity and other aspects of global change. 

In natural grasslands the situation is likely to be complex, because 
anthropogenic impacts such as climate change and eutrophication are 
likely to change diversity—with potential consequences for stability— 
as well as having their own direct effects on stability*’*”**. In particular, 
anthropogenic increases in nutrient inputs into grasslands (through 
direct organic and inorganic fertilization and atmospheric deposition) 
affect the structure and functioning of natural ecosystems worldwide**”. 
For instance, nutrient enrichment usually increases productivity and 
reduces plant diversity****. However, the effect of eutrophication on 
the stability of productivity in natural grasslands remains unclear. On 
the basis of theory and results limited to single-site experiments”, 
we expect eutrophication to reduce the stability of productivity, because 
the well-known negative effects of nutrient enrichment on diversity*** 
could in turn reduce species asynchrony and stability’®'*”?"*. However, 
eutrophication may have additional impacts on stability that are inde- 
pendent of any changes in diversity. The temporal stability of ANPP is 
the ratio of the temporal mean to the temporal standard deviation, so an 
increase in stability can result from an increase in the mean, a decrease 
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in the standard deviation, or both. Because eutrophication is expected 
to increase productivity it may have a stabilizing effect by increasing 
the temporal mean. However, there is also the potential for effects of 
eutrophication on stability through changes in the temporal standard 
deviation, but these are less well understood. We therefore require a 
better picture of how drivers of global change affect ecosystem stability 
both through changes in diversity and through other routes. Here we 
compare the relationship between diversity and stability found in grass- 
land biodiversity experiments with those in fertilized and unfertilized 
plots in natural grasslands. We also assess the effects of eutrophication 
on the diversity-stability relationship both through changes in diver- 
sity and through other routes. 

We evaluated the relationships between species diversity, species 
asynchrony and stability of ANPP across 41 naturally assembled grass- 
land ecosystems on five continents (Extended Data Fig. 1 and Extended 
Data Table 1), using data from the Nutrient Network (NutNet; http:// 
www.nutnet.org) collaborative experiment’””*. We used standardized 
methods to assess plant diversity and ANPP at each site in both unma- 
nipulated controls and experimentally fertilized plots in a well-replicated 
design. We quantified diversity as the average plant species richness in 
standard 1-m” plots over a three-year period. Stability can take a variety 
of meanings in the ecological literature’; here we focus on temporal 
stability of community-level, above-ground live plant biomass from 
all species in a plot (a measure of ANPP) over three years. We define 


temporal stability for each plot as the temporal mean of ANPP divided 
by its temporal variability—that is, the temporal standard deviation 
over a common period (see Methods). 

Stability of ANPP was positively associated with plant diversity in 
the unmanipulated communities (Fig. 1a). Using a hierarchical sam- 
pling design and statistical model we found that stability increased with 
diversity consistently within and among sites, resulting in parallel rela- 
tionships (coloured and black lines, respectively, in Fig. 1a). The con- 
sistent relationship between diversity and stability is concordant with 
experimental results obtained in grasslands across Europe’ and with 
experiments and observations at single locations*’*?!”°. We used mul- 
tiple regression to evaluate the influence of plant diversity and key biotic 
and abiotic factors on stability in our 41 grasslands. Stability was still 
associated with diversity after using covariates to control for differences in 
average site productivity and climatic conditions including annual trends, 
seasonality and extreme or limiting environmental factors (Extended 
Data Tables 1 and 2). Together these results demonstrate that temporal 
stability of ANPP was positively related to variation in plant diversity in 
our 41 naturally assembled grassland ecosystems. 

We determined the role of species asynchrony as a mechanism pro- 
moting stability, by using a community-wide measure that allowed direct 
comparison between communities with different numbers of species’”-”’. 
Because the biomass of individual plant species was available at few 
sites, we used estimates based on our three-year record of the percentage 
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Figure 1 | Relationships of temporal stability of ANPP (upper row) and 
species asynchrony (lower row) with species diversity. a-d, Unmanipulated 
(a, b) and fertilized (c, d) communities of the Nutrient Network. e, f, The 
BIODEPTH network of grassland biodiversity experiments. Relationships of 
temporal stability of ANPP (temporal mean/temporal standard deviation; 
natural log transformed for analysis) of 41 grassland sites of the Nutrient 
Network were positive in the unmanipulated communities (a, b) (slopes and 
95% confidence intervals: 0.028 (0.006 to 0.050) and 0.060 (0.023 to 0.097), but 
not detectible in the fertilized communities (c, d) (—0.001 (—0.025 to 0.022) 
and 0.008 (—0.031 to 0.047)). (e, f) Relationships in the BIODEPTH network 
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were positive (0.018 (0.003 to 0.039) and 0.073 (0.053 to 0.093)). Species 
asynchrony varied from zero (perfect synchrony) to one (perfect asynchrony). 
Species richness values for the Nutrient Network are average values over the 
three years of post-treatment data. Points are values for individual plots 

(n = 117 for Nutrient Network, n = 480 for BIODEPTH). Black lines are the 
back-transformed fixed-effect linear regression slopes between sites from the 
mixed-effects model; coloured lines show patterns within sites. Dashed lines 
show regression slopes between sites in the unmanipulated communities of the 
Nutrient Network. Colours correspond to the ‘colour code’ column in 
Extended Data Table 1. 
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cover of individual plant species in each plot (see Methods). Our ana- 
lysis of potential stability mechanisms showed that species asynchrony 
was positively related to plant diversity (Fig. 1b) and stability (Fig. 2a), 
consistent with theory on the stabilizing effects of species asynchrony 
in species-rich communities. Greater stability at higher diversity can also 
result from an increase in the temporal mean of ANPP with diversity (a 
‘performance-enhancing effect’ that results in a higher ratio of the 
temporal mean to the temporal variation)’"”. Consistent with earlier 
NutNet analyses”’, we found that the temporal mean of ANPP was not 
related to plant diversity (Fig. 3a). Although it is an indirect test, our 
result provides no support for a performance-enhancing effect in stabi- 
lizing higher-diversity communities in our study. Instead, we found 
stronger support for a decrease in temporal variation of ANPP (mea- 
sured by the standard deviation) with diversity (Fig. 3a). In other words, 
greater stability at higher diversity resulted because diversity decreased 
the temporal variation of ANPP relative to its mean, resulting in a more 
stable mean-to-variance ratio. 

To compare our results from naturally assembled grasslands with results 
from artificially assembled biodiversity experiments, we calculated values 
of species asynchrony from the BIODEPTH experiment’. BIODEPTH— 
a pan-European network of grassland biodiversity experiments—was 
conducted at eight field sites with a comparable hierarchical design, 
plot size and measurements (see Methods). Our results are comparable 
to those from BIODEPTH because both studies use the same three- 
year experimental duration and cover a similar range of diversity levels 
(although, by design, biodiversity experiments feature many more low- 
diversity communities than observational surveys). We found that the 
sign and slope of the overall relationships between diversity and stability 
and between diversity and asynchrony from our global multisite study 
were comparable to those from the BIODEPTH network of grassland 
biodiversity experiments (Fig. le, f} compare the solid and dashed lines). 

We tested the impact of eutrophication on temporal stability and 
species asynchrony by using data from NutNet plots that were fertilized 
for three years with a combination of nitrogen, phosphorus, potassium 
and micronutrients (see Methods). Fertilization weakened the positive 
effect of diversity on stability and species asynchrony (Fig. 1c, d; com- 
pare the solid and dashed lines). We expected this result on the basis of 
theory, because nutrient enrichment often reduces diversity**”’, which 
could in turn reduce species asynchrony and stability’*'*”*”°. However, 
although fertilization reduced diversity by an average of 1.3 species 
(95% confidence interval 0.7-1.9) per site (corresponding toa reduction 
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of diversity from 2.0% to 16.9% relative to average levels in the control 
plots ranging from 4.4 to 32.3 species per square metre (Extended Data 
Table 1)), counter to expectations this loss of diversity did not lead toa 
reduction of stability through a decrease in species asynchrony (Extended 
Data Fig. 2). Instead, the lower slope of the diversity—stability relationship 
in the fertilized communities (Fig. 1c) can be explained by a combina- 
tion of two factors. First, fertilization increased the temporal variation 
of ANPP in diverse communities in comparison with unmanipulated 
communities (Fig. 3a, b; compare the dashed lines). Because fertilization 
generally increased mean productivity in comparison with unmanipu- 
lated communities (Fig. 3a, b; compare the solid lines), this increased 
variation weakened the positive effect of diversity on stability in com- 
parison with unmanipulated communities. Second, fertilization resulted 
in a decrease in species asynchrony in diverse communities compared 
with unmanipulated communities (Fig. 1d; compare the dashed and 
solid lines). Because fertilization did not alter the positive relationship 
between species asynchrony and stability (Fig. 2b), this decrease in 
species asynchrony resulted in decreased stability in diverse commu- 
nities compared with unmanipulated communities. In total, the results 
of our fertilization experiment did not show the expected destabilizing 
effects of diversity loss. Instead, eutrophication affected stability directly 
through a combination of diversity-dependent effects on species asyn- 
chrony and on the temporal variation of productivity. These direct effects 
of eutrophication on the diversity-stability relationship could not have 
been predicted from studies of natural or experimental diversity gradients. 

The results of our observational study of naturally assembled grass- 
land communities are consistent with a stabilizing effect of asynchronous 
responses of species to environmental fluctuations in more diverse plant 
communities—a result previously restricted to biodiversity experiments’ 
and observational studies at single locations”’”°. However, the drivers 
of global change causing a loss of diversity may have additional effects 
on stability. The results of our fertilization experiment demonstrate 
impacts on stability that were not caused by changes in diversity but 
came about through effects of eutrophication on both the temporal 
variation in production and on species asynchrony. However, although 
the effects of fertilization on stability were not caused by species loss, the 
changes in species asynchrony and temporal variation that were respon- 
sible were both affected by levels of community diversity. Predicting the 
effects of drivers of global change therefore requires a better understanding 
of both their direct effects on ecosystem stability and their indirect 
effects through changes in diversity. Our results indicate that although 
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Figure 2 | Relationships between temporal stability of ANPP (natural log 
transformed) and species asynchrony in 41 grassland sites of the Nutrient 
Network. a, Unmanipulated communities; b, fertilized communities. The 
temporal stability was greatest in plots in which species fluctuations were 
asynchronized in both the unmanipulated (slope and 95% confidence intervals: 


1.93 (1.70 to 2.16)) and fertilized communities of 41 grassland sites of the 
Nutrient Network (1.90 (1.58 to 2.21)). Points are values for individual plots 
(n = 117). Colours correspond to the ‘colour code’ column in Extended Data 
Table 1. 
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Figure 3 | Relationships of temporal mean and standard deviation of ANPP 
(natural log transformed) to species diversity in 41 grassland sites of the 
Nutrient Network. a, Unmanipulated communities; b, fertilized communities. 
Temporal mean (,u; solid line and filled symbols) was not related to species 
diversity (slope and 95% confidence intervals: 0.01 (—0.02 to 0.03)) in the 
unmanipulated communities (a), and standard deviation (o; dashed line and 
open symbols) was negatively related to species diversity (—0.03 (—0.05 to 
—0.01)), suggesting that greater stability at higher diversity in the 


eutrophication is intended to increase average levels of productivity it 
can also affect its temporal stability. Sustainable management of grass- 
land ecosystems therefore requires a better understanding of the com- 
plex interrelationships between diversity, productivity and stability 
and how they are affected by fertilization. 


METHODS SUMMARY 


The 41 sites are part of the Nutrient Network Global Research Cooperative’* (Extended 
Data Fig. 1 and Extended Data Table 1) (see NutNet’s website). Experimental plots 
included untreated controls and plots with nitrogen, phosphorus and potassium 
and micronutrients added in combination (NPK). The analyses presented here 
include all sites with the first three years of post-treatment community-level ANPP 
(gm yr‘) and species-level ANPP estimates based on percentage cover. 
Wealso examined data from BIODEPTH, a consortium of coordinated biodiversity 
experiments that manipulated plant diversity at eight European grassland sites'. We 
analysed community and species-level ANPP for the three main years of this project*. 
Ecosystem temporal stability was defined for each plot as ji/o, where ju is the tem- 
poral mean of ecosystem-level ANPP and a its temporal standard deviation over 
the three-year period. Species asynchrony was measured for each plot as 1 — Op; 


where @, is species synchrony and is calculated as 1—g, =1 


7 
(oH ci) 
where o; is the temporal standard deviation of species i in a plot with S species 
over the three years’*. Thus, stability and species asynchrony are related such that 
higher levels of species asynchrony are associated with greater stability of the 
community as a whole”. 

We modelled relationships with linear mixed-effects models by using the Ime 
function from the nlme library in R2.15.1. To improve normality, the ecosystem 
temporal stability and community-wide species asynchrony were log-transformed 
before analyses. Sites and blocks nested within sites for the NutNet data and sites 
and species composition nested within sites for BIODEPTH were treated as ran- 
dom effects, allowing both the intercepts and slopes of regression versus diversity 
to vary between sites if supported by model selection. 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Trogocytosis by Entamoeba histolytica contributes to 
cell killing and tissue invasion 


Katherine S. Ralston!, Michael D. Solga’, Nicole M. Mackey-Lawrence’, 


Entamoeba histolytica is the causative agent of amoebiasis, a poten- 
tially fatal diarrhoeal disease in the developing world. The parasite 
was named “histolytica” for its ability to destroy host tissues, which 
is probably driven by direct killing of human cells. The mechanism 
of human cell killing has been unclear, although the accepted model 
was that the parasites use secreted toxic effectors to kill cells before 
ingestion’. Here we report the discovery that amoebae kill by ingest- 
ing distinct pieces of living human cells, resulting in intracellular 
calcium elevation and eventual cell death. After cell killing, amoebae 
detach and cease ingestion. Ingestion of human cell fragments is re- 
quired for cell killing, and also contributes to invasion of intestinal 
tissue. The internalization of fragments of living human cells is remi- 
niscent of trogocytosis (from Greek trogo, nibble) observed between 


Figure 1 | Amoebae internalize human cell fragments, preceding human 
cell death. a, DiD and CMFDA-labelled human Jurkat cells (H); an amoeba 
(A) internalizes human cell fragments (arrows) over time. Images are 
representative of three independent experiments. b, DiD and JC-1-labelled 
human Jurkat cells; mitochondria (arrow) are ingested by the amoeba in an 
ingested fragment. Images are representative of six independent experiments. 
c, Electron microscopy with gold-labelled human Jurkat cells. Inset (top left), 
gold-labelled human cell membrane (gold, circles). Inset (right), fragments 
(arrows) within the amoeba. Images are representative of two independent 
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immune cells”, but amoebic trogocytosis differs because it results 
in death. The ingestion of live cell material and the rejection of corpses 
illuminate a stark contrast to the established model of dead cell clear- 
ance in multicellular organisms’. These findings change the model 
for tissue destruction in amoebiasis and suggest an ancient origin of 
trogocytosis as a form of intercellular exchange. 

Amocebiasis is prevalent in the developing world, and in the first year 
of life, approximately one third of infants in an urban slum in Dhaka, 
Bangladesh are infected with E. histolytica*. Malnourishment and stunt- 
ing are associated with repeated infections in children’. During infection, 
amoebae colonize the colon, which can be asymptomatic or result in 
diarrhoea, colitis or extraintestinal disease. Invasive disease is charac- 
terized by profound tissue destruction, manifesting as massive intestinal 


JC-1/DiD 


experiments. d, DiD and Fluo-4-labelled human Jurkat cells; 

with SYTOX blue present during imaging. Arrows, amoebic trogocytosis 
(1:30), intracellular calcium concentration elevation (2:30) and membrane 
permeabilization (9:30). Images are representative of 15 independent 
experiments. e, Timing in 60 cells from 15 independent experiments; shown are 
data points, means and standard deviations; P values from t-tests: *P < 0.05, 
**P < 0.01, ***P < 0.001. Scale bars, 10 jum (a, b, d), 2 tum (b, insets), 0.5 zm 
(c, top left, right) and 5 1m (c, bottom left). 
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USA. ?School of Life Sciences, Jawaharlal Nehru University, 110067 New Delhi, India. Department of Pathology, University of Virginia, Charlottesville, Virginia 22908, USA. 
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ulceration or fatal extraintestinal abscesses. This is probably driven by 
the cell killing activity of the amoebae, which are potently cytotoxic and 
capable of killing a variety of cell types including epithelial cells, immune 
cells and many tissue culture lines. Contact mediated by the amoeba sur- 
face galactose- and N-acetyl-p-galactosamine-binding (Gal/GalNAc) 
lectin is critical, but the molecular mechanism by which cells are killed 
is unknown. The established model is that the amoebae first kill cells 
and then ingest dead cell corpses’’° (Extended Data Fig. 1a). 

To elucidate the mechanism for human cell killing, we used live micros- 
copy to examine host-parasite interactions directly. Surprisingly, we found 
that, immediately after human cell contact, the amoebae internalized 
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distinct fragments of human cell membrane (Fig. la). This was ini- 
tiated within one minute of human cell contact, and amoebae contin- 
ued to ingest additional fragments thereafter (Supplementary Video 1). 
By pre-labelling human cellular compartments, we found that ingested 
membrane fragments contained human cell cytoplasm 90% of the time 
(Fig. 1a) and mitochondria 31% of the time (Fig. 1b). Pre-labelling of 
human cells with biotin and streptavidin-gold before co-incubation with 
amoebae further demonstrated that the ingested fragments were human- 
derived (Fig. 1c and Extended Data Fig. 2a). There was pronounced actin 
polymerization within the amoebae at the site of human cell contact and 
ingested fragments were surrounded by polymerized actin (Extended Data 


Figure 2 | Amoebic trogocytosis is predominant 
and specific to live human cells. Imaging flow 
cytometry with CMFDA-labelled amoebae 

(A, blue), DiD-labelled human Jurkat cells 

(H, yellow) and live/dead violet-labelled dead cells 
(purple). a, Detection of internalized human 
material. b, Measurement of fragmentation of 
internalized human material, gated from low to 
high. c, Example merged CMFDA, DiD and bright 
field images. Images are representative of 10,000 
images collected for each replicate. d, Detection of 
dead cells not attached to amoebae. e, Example 
images; live/dead violet (left panels) and merged 
CMEDA, DiD and bright field (right panels). 
Images are representative of 10,000 images 
collected for each replicate. f, Detection of dead 
cells attached to amoebae. g, Example images, as in 
e. Images are representative of 10,000 images 
collected for each replicate. h-k, Means and 
standard deviations for biological replicates 
(10,000 events/each). P values from t-tests: 

*P< 0.05, **P < 0.01, ***P< 0.001. 


24 APRIL 2014 | VOL 508 | NATURE | 527 


LETTER 


Fig. 2b, c). Ingested fragments had two membranes, with an inner human 
cell membrane and an outer amoeba membrane (Extended Data Fig. 2a, d). 

Another unexpected finding was that human cells were alive when 
ingestion of fragments initiated. Irreversible intracellular calcium eleva- 
tion occurred in human cells within the first few minutes, typically fol- 
lowing detectable amoebic trogocytosis (Fig. 1d, e and Supplementary 
Video 2). Despite the fact that numerous fragments were physically ex- 
tracted from human cells by the amoebae, human cells initially retained 
plasma membrane integrity (Extended Data Fig. 3a). However, human 
cells were eventually killed as evidenced by loss of plasma membrane 
integrity’ (Fig. 1d, e, Extended Data Fig. 3b and Supplementary Video 1), 
degradation of nuclear DNA and loss of mitochondrial potential (Ex- 
tended Data Fig. 4a, b). Interestingly, ingestion of fragments ceased 
once cells were dead (Extended Data Figs 1b and 3b and Supplemen- 
tary Video 1), at which point the amoebae detached from corpses. Inges- 
tion of fragments required living cells, because pre-killed human cells 
were ingested intact (Extended Data Figs 1c and 4c, d). The interna- 
lization of fragments of living cells is reminiscent of trogocytosis**" 
(from Greek trogo, nibble); hence we refer to the E. histolytica process 
as ‘amoebic trogocytosis’. 

To quantify amoebic trogocytosis and cell death at the population 
level, we used imaging flow cytometry (Fig. 2 and Extended Data Fig. 5). 
We found that internalization ofhuman cellular material increased from 
63.1% at 5 min to 92.4% at 40 min (Fig. 2a, h). To determine whether 
internalized material represented whole cells or fragments, we measured 
the fragmentation of the human cells (Fig. 2b, c, i,j). The population re- 
presenting high fragmentation (numerous ingested human cell fragments) 


increased over time (Fig. 2b, j), from 6.1% at 5 min to 69.8% at 40 min. 
In contrast, the population representing low fragmentation (few ingested 
human cell fragments), decreased over time (Fig. 2b, i). Thus, population- 
level quantification demonstrated that amoebic trogocytosis was pre- 
dominant. The majority of dead human cells were not attached to 
amoebae (Fig. 2d, e, k), and very few remained attached to amoebae 
(Fig. 2f, g, k), supporting our observation that amoebae cease trogo- 
cytosis once cells have been killed (Extended Data Fig. 3b and Sup- 
plementary Video 1). 

Neither amoebic trogocytosis nor cell killing occurred when cells were 
artificially brought together by centrifugation and co-incubated at 4 °C 
instead of 37 °C (Extended Data Figs 6a and 7f), in agreement with the 
known inhibition of amoebic cytotoxicity at 4 °C’. Pharmacological 
inhibition of amoebic phosphatidylinositol 3-kinase (PI3K) signalling 
using wortmannin also inhibited amoebic trogocytosis and cell death 
(Extended Data Fig. 6g, j). Interference with the dynamics of microfila- 
ment formation using cytochalasin D inhibited both amoebic trogocy- 
tosis and cell death (Extended Data Fig. 7a-f). Tetracycline-inducible 
overexpression of a kinase-dead point mutant of the amoebic C2 domain- 
containing protein kinase (EhC2PK)* reduced both amoebic trogocytosis 
and cell death (Fig. 3). EnC2PK is an early regulator of phagocytosis in 
E, histolytica, thus some initiators of phagocytosis may be common 
to amoebic trogocytosis. To examine signalling upstream of EhC2PK, 
we used blocking antibodies to interfere with the amoebic Gal/GalNAc 
lectin, which has been shown to havea role in initiating cell killing’**’° 
separable from its role in host-cell attachment’*. An anti-Gal/GalNAc 
lectin blocking monoclonal antibody directed to epitope 1, which reduces 


Figure 3 | Amoebic trogocytosis contributes to 


human cell killing. a-d, Imaging flow cytometry 
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analysis with amoebae expressing a tetracycline 
(Tet)-inducible copy of EnC2PK K179A. 

a, Measurement of fragmentation. b, Detection of 
dead cells. c, d, Means and standard deviations for 
biological duplicates at each time point (10,000 
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killing while enhancing attachment”, inhibited both amoebic trogocytosis 
and cell death compared to a control monoclonal antibody directed to 
epitope 3 (Extended Data Fig. 7g-l). Thus amoebic trogocytosis contri- 
butes to human cell killing and requires physiological temperature, actin 
rearrangements and Gal/GalNAc lectin, EnC2PK and PI3K signalling. 
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The hallmark of amoebiasis is the detection of amoebae with ingested 
red blood cells'’. Live imaging demonstrated that amoebic trogocytosis 
occurred during red blood cell ingestion (Fig. 4a, b and Supplementary 
Videos 3 and 4). We also detected amoebic trogocytosis of human colonic 
epithelial cells (Extended Data Fig. 8). Intriguingly, amoebic trogocytosis 
occurred when amoebae were incubated with ex vivo mouse intestinal 
tissue and monitored live in three dimensions using two-photon micros- 
copy (Fig. 4c and Supplementary Videos 5 and 6). Amoebae traversed 
the intestinal crypts, as has been demonstrated in studies using ex vivo 
human intestine’*. Tissue cell death occurred and amoebic trogocytosis 
of intestinal enterocytes preceded amoebic invasion into the tissue (Ex- 
tended Data Fig. 9 and 10 and Supplementary Videos 7 and 8), further 
supporting its pathologic relevance. Moreover, inhibition of amoebic 
trogocytosis by treatment with wortmannin or cytochalasin D signifi- 
cantly blocked tissue invasion (Fig. 4d). Further supporting the rele- 
vance of amoebic trogocytosis to pathogenesis, we found that amoebae 
that had previously undergone amoebic trogocytosis were primed to 
undergo more ingestion and more cell killing (Fig. 4e-g) than amoebae 
that had not undergone trogocytosis. More of the primed amoebae under- 
went trogocytosis (Fig. 4e), although they ingested the same number of 
fragments as non-primed amoebae (Fig. 4f), indicating that priming 
enhances the initiation of amoebic trogocytosis. 

Therefore several lines of evidence demonstrate that amoebic tro- 
gocytosis contributes to cell killing and is likely to contribute to patho- 
genesis in vivo. Cell death probably stems from the accumulation of 
membrane damage. Targeted cells initially retain membrane integrity, 
hence ingestion of one fragment is not enough to kill. Inhibition experi- 
ments demonstrated that a reduction in the number of ingested frag- 
ments almost completely prevented cell death, indicating that there is 
a threshold of tolerated damage. It has previously been reported that amoe- 
bae ingest killed cells’®, but in these studies the target cells were pre- 
killed and probably had different characteristics from cells killed directly 
by the amoebae. Consistent with this idea, we found that pre-killed cells 
were internalized whole and thus they were recognized differently than 
endogenously killed cells, which the amoebae ceased ingesting. 

The rejection of dead cell corpses is remarkable and makes a nutri- 
tional role for amoebic trogocytosis seem unlikely. Intestinal bacteria 
may be the primary food source during infection and the energetic costs 
of full internalization of human cells may outweigh the benefits. The 
large size of epithelial cells and the tight intercellular connections may 
make ingestion of entire cells difficult. Amoebic trogocytosis could poten- 
tially lower tissue density and create a more spacious environment 
for amoeboid migration. Ingesting fragments could also provide the 
amoebae with a means of sensing the environment. Interestingly, amoe- 
bae that had undergone amoebic trogocytosis were primed to undergo 
more trogocytosis, suggesting that there might be ‘feed-forward’ regu- 
lation, similar to what has been demonstrated during the ingestion of 


Figure 4 | Amoebic trogocytosis occurs with red blood cells, contributes to 
intestinal invasion, and promotes enhanced cell killing. a, DiD-labelled red 
blood cells (RBC) and CMFDA-labelled amoebae (A); ingested fragments 
(arrows) and a fragment that remains extracellular (0:51). Scale bar, 5 um. 
Images are representative of three independent experiments. b, Example 
three dimensional reconstruction with DiD-labelled red blood cells and 
CMFDA-labelled amoebae; ingested fragments (arrows) and a fragment that 
remains extracellular (3:25). Images are representative of three independent 
experiments. c, Planes from live three-dimensional two-photon microscopy 
with mouse intestinal tissue from a mouse expressing membrane-targeted 
EGFP (yellow); calcein violet-labelled amoebae (blue) are ingesting fragments 
(arrows). Images are representative of three independent experiments. 

d, Amoebae treated with wortmannin or cytochalasin D were incubated with 
tissue. Invasion depth, means and standard deviations from biological 
replicates (at least 40 cells per treatment); P values are indicated. e-g, Amoebae 
were incubated with or without human Jurkat cells (‘primed’), before 
incubation with labelled human Jurkat cells. e, Measurement of internalization. 
f, Measurement of fragmentation. g, Detection of dead cells. Means and 
standard deviations for biological duplicates (10,000 events/each). P values 
from t-tests: *P < 0.05, **P < 0.01, ***P < 0.001. 
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beads and apoptotic cells'’. Therefore amoebic trogocytosis and destruc- 
tion of cells may promote further destruction. Further work will be 
needed to confirm with certainty that the proposed model for amoebic 
trogocytosis in cell killing and tissue invasion is applicable to patho- 
genesis in vivo, and to explore the implications of this process in disease. 
Distortion of target cell shape during amoebic ingestion has been 
reported previously”*”’. We have similarly observed distortion before 
the appearance of fragments. We used a variety of membrane labels, 
and fragments are more readily detected using these tightly localized 
and bright labels, than with more diffuse probes. It is therefore possible 
that, owing to differences in detection methods and the dynamic and rapid 
appearance of fragments, they were not previously observed. Because 
increased red blood cell rigidity reduces cell distortion during ingestion”, 
it is also conceivable that differences in the deformability of different 
cell types influence the extent of fragmentation that occurs. Less deform- 
able targets like beads are likely to be ingested whole. Finally, the size of 
the target cell type determines the amount of material that remains 
extracellular; very little remains following red blood cell ingestion, whereas 
much more remains following Jurkat or epithelial cell ingestion. 
Some mechanistic details of trogocytosis between immune cells are 
beginning to be defined”. A key difference between immune cell trogo- 
cytosis and amoebic trogocytosis is that immune trogocytosis does not 
result in cell death. This may be because immune trogocytosis involves 
the exchange of fewer fragments, and we find that targeted cells can with- 
stand the ingestion of a limited number of fragments. Interestingly, 
there are hints that protozoa like Naegleria fowleri may also be capable 
of ingesting host cell fragments’’. Therefore, trogocytosis as a form of 
intercellular exchange may be more evolutionarily ancient and wide- 
spread than is currently appreciated. Finally, amoebic trogocytosis is a 
potentially promising target for the future development of new ther- 
apeutics for amoebiasis, a major neglected disease in the developing world. 


METHODS SUMMARY 


Amoebae (HM1:IMSS), human Jurkat cells and human red blood cells were pre- 
pared as described'®™*, In some experiments, parasites were stably transfected with 
a plasmid for tetracycline-inducible expression of EnC2PK K179A™. Mice with 
membrane-targeted enhanced green fluorescent protein (EGFP) were used in 
compliance with the University of Virginia Institutional Animal Care and Use 
Committee, and PHS and USDA guidelines. For live imaging, Jurkat cells were pre- 
labelled with DiD, Dil, CMFDA and/or JC-1 (Invitrogen) and were combined with 
amoebae immediately before imaging on a Zeiss LSM 510 or LSM 700 confocal 
microscope. In some experiments, SYTOX blue (Invitrogen) was present during 
imaging. Human red blood cells were labelled with DiD. For electron microscopy, 
Jurkat cells were pre-labelled with Sulfo-NH-SS biotin (Thermo) and streptavidin- 
6nm gold (Electron Microscopy Sciences). Amoebae and Jurkat cells were com- 
bined before fixation, processing and imaging on a JEOL 1230 microscope. For 
imaging flow cytometry, amoebae were pre-labelled with CMFDA and Jurkat cells 
with sulphonated DiD. Pharmacological interventions were carried out in the amoe- 
bae after CMFDA labelling, using wortmannin or cytochalasin D (Sigma). At each 
time point, samples were transferred to ice and labelled with live/dead fixable violet 
(Invitrogen) before fixation. Samples were collected on an Amnis ImageStreamX 
or ImagestreamX Mark II flow cytometer. 10,000 to 15,000 events were collected 
for each sample and data were analysed using Amnis IDEAS. For ex vivo imaging, 
amoebae were labelled with calcein violet (Invitrogen). Mouse caeca were isolated 
and amoebae added immediately before imaging with a FLUOVIEW FV1000 
Olympus multiphoton system. Three dimensional reconstructions were made with 
Volocity (PerkinElmer). Depth of tissue invasion was quantified in a blinded fashion 
in ten non-consecutive frozen sections per treatment. Quantitative data are expressed 
as the mean and standard deviation of the mean. Statistical significance was deter- 
mined with the unpaired two-tailed Student’s t-test (Prism, GraphPad Software, Inc.). 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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in the failing heart 
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Heart failure is characterized by a debilitating decline in cardiac 
function’, and recent clinical trial results indicate that improving 
the contractility of heart muscle cells by boosting intracellular calcium 
handling might be an effective therapy’. MicroRNAs (miRNAs) 
are dysregulated in heart failure** but whether they control contrac- 
tility or constitute therapeutic targets remains speculative. Using 
high-throughput functional screening of the human microRNAome, 
here we identify miRNAs that suppress intracellular calcium hand- 
ling in heart muscle by interacting with messenger RNA encoding 
the sarcoplasmic reticulum calcium uptake pump SERCA2a (also 
known as ATP2A2). Of 875 miRNAs tested, miR-25 potently delayed 
calcium uptake kinetics in cardiomyocytes in vitro and was upre- 
gulated in heart failure, both in mice and humans. Whereas adeno- 
associated virus 9 (AA V9)-mediated overexpression of miR-25 in vivo 
resulted in a significant loss of contractile function, injection of 
an antisense oligonucleotide (antagomiR) against miR-25 markedly 
halted established heart failure in a mouse model, improving car- 
diac function and survival relative to a control antagomiR oligonu- 
cleotide. These data reveal that increased expression of endogenous 
miR-25 contributes to declining cardiac function during heart fail- 
ure and suggest that it might be targeted therapeutically to restore 
function. 

Heart failure is the culmination of diverse cardiovascular diseases, 
including hypertension, ischaemic disease and atherosclerosis, valvular 
insufficiency, myocarditis, and contractile protein mutations, and is uni- 
formly characterized by a progressive loss of contractile function and 
reserve’. The prevailing therapeutic strategy is to block the deleterious 
effects of the renin—angiotensin and sympathetic systems, but existing 
drugs target few mechanisms within the failing cardiomyocyte and there 
is a critical need for novel drugs, especially for treating patients with 
advanced heart failure®’. A complex intracellular network balances 
contractility and intracellular Ca” * handling in relation to workload®; 
however, the role of miRNAs in cardiac contractility remains largely 
unexplored. miRNAs fine tune nearly all normal and pathological pro- 
cesses by downregulating proteins that occupy key nodal points in 
biological control networks”’®. We reasoned that miRNAs that repress 
contractility'' might be upregulated during human heart failure, and 
might therefore constitute novel targets for therapeutic intervention. 

The calcium-transporting ATPase SERCA2a is the primary mech- 
anism of Ca”* uptake during excitation-contraction coupling in cardi- 
omyocytes. Impaired Ca** uptake resulting from decreased expression 
and reduced activity of SERCA2a is a hallmark of heart failure’. Accord- 
ingly, restoration of SERCA2a by gene transfer has proven effective in 
improving key parameters of heart failure in animal models’” and more 
recently in clinical trials’. Thus, miRNAs that downregulate SERCA2a 
might be elevated in heart failure and compromise cardiac function. 
Computational algorithms predict >300 miRNAs that target SERCA2a 
(miRNA Data Integration Portal’*), but they are error prone, necessitating 


empirical evaluation'*. We therefore functionally screened a whole- 
genome collection of miRNAs for selective downregulation of the Ca~* 
pump. To perform the screen, we fused the SERCA2a mRNA 3’ untrans- 
lated region (UTR) sequence downstream of an enhanced green fluor- 
escent protein (eGFP) coding region, making a ‘target sensor’ construct 
to permit detection of active miRNAs by a decrease in eGFP fluores- 
cence (Fig. la, b). In the primary screen, 144 miRNAs reduced eGFP 
fluorescence by >30% with P < 0.05 (Fig. 1c, d), and 82 were confirmed 
by testing through a dose range (Fig. le, fand Supplementary Table 1). 
Thirty-two were evolutionarily conserved and 15 were both evolutio- 
narily conserved and reported to be upregulated in human heart failure 
(see Methods) (Fig. le). Four caused a highly significant (P < 0.001, 
one-tailed analysis of variance (ANOVA)) prolongation of the decay 
phase of the Ca** transient in the cardiomyocyte HL-1 cell line, shown 
here as the duration from 75% to 25% maximal value, CaTD75_5. (Fig. 1g), 
measured using an automated platform’. The most potent miRNA was 
miR-25, which elicited a physiological effect comparable to that of the 
short interfering RNA (siRNA) directed against SERCA2a, siSERCA2a 
(Fig. 1h). We confirmed that miR-25 is upregulated in human myo- 
cardial samples from patients with severe heart failure (Fig. 1i). In situ 
hybridization revealed that miR-25 is expressed primarily in cardio- 
myocytes of trans-aortic constriction (TAC)-induced failing mouse 
hearts, with no detectable expression in cardiac fibroblasts or vascular 
endothelial cells, and low expression in vascular smooth muscle (Fig. 2). 
To establish a link between miR-25 and cardiac function, we iden- 
tified putative protein targets involved in Ca”* handling computation- 
ally (DianaLab miRPath with TargetScan Mouse v.5.0, see Methods). 
In addition to SERCA2a, these included inositol-1,4,5-trisphosphate 
receptor 1 (IP3R1), which was selectively downregulated by miR-25 tran- 
sient transfection in HL-1 cells (Fig. 3a and Extended Data Fig. 1a). 
Other candidate proteins involved in calcium handling, including the 
sodium-calcium exchanger (NCX1; also known as SLC8A1), calmodulin 
kinase 2 (CaMKII), phospholamban (PLN) and calmodulin 3 (CALM3) 
were unaffected in HL-1 cells transfected with miR-25 (Extended Data 
Fig. 2). miRNAs bind cognate mRNAs by imprecise base pairing to 
specific recognition elements”’®. Single putative miR-25 recognition 
sites were identified within SERCA2a and IP3R1 3’ UTR sequences 
by TargetScan Human v.6.2 (Fig. 3b and Extended Data Fig. 1b). Muta- 
tion of these sequences abolished the ability of miR-25 to inhibit 
reporter expression (Fig. 3b and Extended Data Fig. 1b), further sup- 
porting selective miR-25 interactions with the mRNAs, and indicating 
that the single recognition elements identified in the 3’ UTRs of the 
SERCA2a and IP3R1 mRNAs are sufficient for miR-25 activity. 
Having shown that miR-25 regulates SERCA2a and IP3R1, we eval- 
uated whether siRNAs directed against these proteins could mimic the 
effect of miR-25 on cardiac Ca** transient kinetics in vitro. Transfection 
of siSerca2a significantly slowed the decay (Ca** re-uptake) phase of the 
transient (CaTD7;_5) relative to controls in spontaneously contracting 
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Figure 1 | High-content screening identifies miRNAs that control 
SERCA2a. a, b, Target sensor construct (a) and screening workflow (b). cds, 
coding DNA sequence; Neg. ctrl, negative control; pPGK, phosphoglycerate 
kinase promoter. c, Screening summary. d, e, Primary (n = 3) (d) and 
confirmatory (” = 3) (e) screen data plotted as percentage inhibition relative to 
siRNA against SERCA 2a (100% inhibition) and scrambled sequence control 
(0% inhibition) (x-axis; AUC, area under the curve) and P value from Student’s 
t-test (y-axis). HF, heart failure. f, eGFP-SERCA2a inhibition by miR-25 and 
inactive miR-766 relative to scrambled sequence control (n = 10). g, Ca”* 
transient kinetic analysis of HL-1 cells transfected with miRNAs that inhibited 
eGFP-SERCA 2a (panel e) and are evolutionarily conserved and upregulated in 


HL-1 cells (Fig. 3c and Extended Data Fig. 1c) and neonatal rat vent- 
ricular cardiomyocytes (NRVCs; Extended Data Fig. 1d); however, the 
maximal velocity of the upstroke phase of the Ca** transient (Vimax) Was 
unchanged (Extended Data Fig. 3), accurately reproducing the effect of 
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Figure 2 | Endogenous miR-25 expression in the heart. a-c, In situ 
hybridization, revealing endogenous miR-25 expression in failing LV 
myocardium (red) compared with periostin (a, top panel versus bottom panel), 
a-smooth muscle actin («-SMA) (b) and lectin BS-1 (c) staining (each in 
green). Hoechst 33342 staining marks nuclei (blue). Arrows indicate examples 
of non-cardiomyocytes. Scale bar, 20 j1m. Data are representative of two 
biological replicates. 
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human heart failure (” > 550 cells per group). Box shows 25th to 75th 
percentiles; whiskers show 5th and 95th percentiles; dots indicate outliers. 

*P < 0.05, **P < 0.01, ***P < 0.001. NS, not significant (one-tailed ANOVA). 
h, Frequency distribution and log-normal curve fits for CaTD75_25 values from 
panel f, normalized to sample size. Both siSerca2a and miR-25 increased 
CaTD7s_25 values. i, miR-25 is upregulated in human heart failure samples, as 
demonstrated by quantitative polymerase chain reaction (qPCR). Data are 
represented as mean + standard error of the mean (s.e.m.). **P < 0.01 (n = 5), 
Student’s t-test. All replicates (n) are biological. Primary and confirmatory 
screens were performed once (d, e); subsequent analyses were performed 

two (f-h) or three (i) times. 


miR-25 and mimicking the ~1.5-2-fold decline reported for ventricu- 
lar cardiomyocytes isolated from failing human hearts’*””. siRNA against 
IP3R1 only minimally affected Ca”* transient kinetics in HL-1 cells 
and not at all in NRVCs (CaTD59, CaTD7s_25 and Vinax3 Fig. 3c and 
Extended Data Fig. 3), suggesting that the predominant effect of miR- 
25 on Ca*" transient kinetics is mediated through downregulation of 
SERCA2a. 

We evaluated the physiological effect of blocking miR-25. AntagomiRs 
are antisense oligonucleotides modified to enhance duplex stability 
and have been used effectively to inhibit miRNA function in vitro and 
in vivo'®. Anti-miR-25 transfection alone decreased CaTD75_35 in HL-1 
cells but had no effect in NRVCs (Fig. 3c and Extended Data Fig. 1d). 
Importantly, when co-transfected, anti-miR-25 cancelled the prolong- 
ing effect of miR-25 on CaTD7s_»5, restoring kinetic parameters to near 
normal in both cell types. 

We next assessed the physiological consequences of administering 
miR-25 and anti-miR-25 in vivo. AAV9-mediated gene transfer of miR-25 
increased the levels of miR-25 by approximately 50% and correspond- 
ingly decreased SERCA2a levels in ventricular myocardium (Fig. 3d-f), 
causing a progressive decline in fractional shortening, an index of car- 
diac function (Fig. 3g, h). Haemodynamic analyses at the termination 
of the studies (6 weeks after AAV transfer) confirmed that left vent- 
ricular (LV) function had declined in miR-25-injected mice (Fig. 3i-k 
and Extended Data Fig. 4b, c). By contrast, AAV-mediated transfer of 
miR-92a (Fig. 3d, e), which has the same seed sequence (AUUGCAC) 
as miR-25, did not affect SERCA2a levels (Fig. 3f) or significantly affect 
LV function (Fig. 3g-k and Extended Data Fig. 4b, c), although it down- 
regulated integrin 5, a target involved in angiogenesis'” (Extended 
Data Fig. 4a), despite the identical copy number of integrated AAV 
genomes (Extended Data Fig. 5). Together, these results point to a select- 
ive interaction between miR-25 and Serca2a mRNA that is consistent 
with the importance of non-seed sequences for target specificity”, and 
also indicate that elevated miR-25 can depress cardiac function. To 
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Figure 3 | miR-25 directly targets SERCA2a and regulates contractile Ca** 
kinetics. a, Effects of miR-25, anti-miR-25 and controls (scrambled sequence 
and Serca2a siRNA) on SERCA2a protein levels. b, Mutagenesis of the putative 
mir-25 recognition element in the SERCA2a mRNA 3’ UTR abolished 
inhibition by miR-25. *P < 0.05, **P < 0.01 (n = 10; Student’s t-test), 
compared with control. Mut., mutant; Norm., normalized; WT, wild type. 

c, Anti-miR-25 diminished the effect of miR-25 on CaTD7;_)5 in transfected 
HL-1 cells. siRNA against IP3R1 had a minimal effect. Extended Data Fig. 3 
shows additional parameters and similar results using NRVCs. Box plots as in 
Fig. 1g. *, + indicate significant differences (* or tP < 0.05, ** or *fP < 0.01, 
*** or +F +P < 0.001, one-tailed ANOVA) from negative control (*) or miR-25 
(+) (n> 550 cells per group). The data in panels a-c represent one of three 


evaluate the selectivity and efficacy of anti-miR-25, we intravenously 
injected anti-miRNAs formulated with in vivo-jetPEI reagent mixture. 
Anti-miR-25 injection decreased endogenous miR-25 levels in LV myo- 
cardium of both wild-type and SERCA2a knockout mice relative to 
treatment with a control (scrambled sequence) anti-miR (Extended Data 
Fig. 6a), but did not affect miR-92a (Extended Data Fig. 7). Accordingly, 
anti-miR-25 increased SERCA2a levels as expected in wild-type mice 
(Extended Data Fig. 6b). Notably, however, anti-miR-25 did not improve 
the cardiac morphometric or haemodynamic parameters of SERCA2a 
knockout mice (Extended Data Fig. 8), suggesting that the Ca7* pump 
is a critical target for miR-25. 

Because miR-25 decreased SERCA2a levels in vitro and in vivo, we 
evaluated whether administration of anti-miR-25 would affect cardiac 
function during chronic heart failure. Mice were subjected to 3 months 
of TAC to chronically increase LV load and cause LV dilation before 
administering anti-miR-25. Once heart failure was established, anti- 
miR-25 or a control anti-miRNA were injected intravenously and the 
mice were subsequently monitored for their effects on heart function 
(Fig. 4a). Echocardiography revealed substantial improvements in car- 
diac function after injection of anti-miR-25 at 4.5 and 5.5 months after 
TAC, despite constant pressure overload, compared with severe deteri- 
oration in animals injected with the control anti-miRNA (Fig. 4b, c 
and Supplementary Table 2). Furthermore, haemodynamic analyses at 
the termination of the studies (5.5 months after TAC) showed sub- 
stantially improved LV function in the anti-miR-25-injected mice, with 
the treatment effectively restoring the load-independent parameter 
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independent experiments. d, Protocol for AAV9-mediated cardiac gene 
transfer. e, AAV-miR-25 (n = 4) and AAV-miR-92a (n = 5) increased levels of 
their respective miRNAs relative to AAV-VLP control (n = 3). f, g, miR-25, but 
not miR-92a, diminished SERCA2a protein levels (f) and fractional shortening 
(FS; percentage) after injection (n = 5 for each cohort; g). h, LV M-mode 
images. i-k, Pronounced effect of miR-25 on pressure-volume relationship 
(i) and Pinax (j) showing decreased function relative to miR-92a and control 
AAVS9; and the effect on t, the time constant for LV relaxation, suggestive of 
diastolic dysfunction (n = 5, AAV9-VLP; n = 4, AAV9-miR-25; n = 5, AAV9- 
miR-92a). Data are represented as mean + s.e.m. in all panels except c. NS, not 
significant. All replicates (n) are biological. 


end systolic pressure volume relationship (ESPVR) and the ejection frac- 
tion (EF) to normal levels (Fig. 4d-f and Supplementary Table 3). The 
heart weight to body weight ratio was also stabilized (Fig. 4g). Survival 
of the TAC-induced heart failure animals was also improved, with 7/8 
anti-miR-25-injected and 8/8 sham-operated animals surviving versus 
7/22 of the control anti-miRNA-injected animals (Kaplan-Meier ana- 
lysis, Extended Data Fig. 9), reflected by an increased probability of 
survival of TAC animals injected with anti-miR-25 versus control anti- 
miRNA (P = 0.0131, log-rank test). Consistent with improved cardiac 
function, anti-miR-25-injected mice had significantly reduced levels of 
endogenous miR-25 compared with control animals (both TAC (Fig. 4h) 
and sham-operated (Extended Data Fig. 6a)). Neither anti-miR-25 nor 
the control anti-miRNA affected the cardiac physiology of sham-operated 
animals (Extended Data Fig. 8). Total and SUMOylated SERCA 2a levels 
were significantly increased (Fig. 4i and Extended Data Fig. 10), indi- 
cating that anti-miR-25 injection restored the loss of SERCA2a protein 
as well as post-translational modifications of SERCA2a such as SUMOy- 
lation (which improves transporter stability and ATPase activity’) 
despite the prior onset of heart failure. Furthermore, injection of anti- 
miR-25 reduced fibrosis (Fig. 4j-l, n—-p, r-t, v) and also normalized 
cardiomyocyte cell size (Fig. 4m, q, u, w). Given that anti-miR-25 had 
no salutary effect on cardiac morphometric or haemodynamic para- 
meters in SERCA2a knockout mice (Extended Data Fig. 8), did not 
affect expression of other miRNAs with homologous seed sequences 
(Extended Data Fig. 7), and that AAV9-mediated cardiac transfer of 
miR-25 selectively affected SERCA2a (Fig. 3f), we propose that the 
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Figure 4 | Inhibition of miR-25 normalizes TAC-induced cardiac 
dysfunction. a, Protocol for anti-miR-25 therapy in the mouse TAC heart 
failure model. b, LV M-mode images showing dilation with control 
anti-miRNA in contrast with anti-miR-25 injected mice. ¢, Effect on 
echocardiographic indices of LV function: EF and fractional shortening (FS), 
expressed as percentages. The number of animals initiated is n = 4 (sham 
operated), 10 (TAC plus control anti-miRNA) and 4 (TAC plus anti-miR-25); 
the numbers analysed per time point are indicated and reflect deaths in 

the TAC plus control anti-miRNA group. d-f, Haemodynamic effects. 

d, Pressure-volume plots of treatment cohorts as indicated. Note anti-miR-25 
trend towards normalization of haemodynamic indices of end ESPVR (slope 
of lines in d, e), and EF (f). n = 4 (sham operated), 4 (TAC plus control 
anti-miRNA) and 3 (TAC plus anti-miR-25). g, Heart weight (HW) to body 


beneficial effect of anti-miR-25 is due to the inhibition of pathologically 
upregulated endogenous miR-25 and the subsequent restoration of 
SERCA2a activity. To our knowledge, these data provide the first evidence 
that cardiac delivery of an anti-miRNA can directly control SERCA2a 
protein levels to achieve a long-term improvement in cardiac function. 

These studies identified miR-25 as a critical repressor of SERCA2a 
and cardiac function during heart failure. Of note, several mechanisms 


534 | NATURE | VOL 508 | 24 APRIL 2014 


oe 4°44 
—ELTAC + control anti-miRNA 10 10 8 6 
—t— TAC + anti-miR-25 4 4 4 4 


sed ! NS 
Control ek 
mike anti- Anti- 12 == 
Sham miRNA miR-25 &_ 
a3 10 
SERCA? ee SE os 
fe 
BS 06 
CAPOH er 22 04 
BS 02 
ire} gs 0 
FG 528 
5 ES fe¢ 
a ae: 
5 & 5 = 
£ 3 
NS 
Vv => 
we 40) see ae 
8 35 
6 3.0 
& 25 
g 20 
& 41.5 
2 
8 1.0 
5 05 
z 
0 
528 
5 = 
on Ee é 
& 
zg 
5 
w 8 
_ 700 
= 600 NS 
- kk okie 
§. 500, 
% & 400 
8 
& § 300 
£* 200 
3 
S100 
= 0 
to 
on € = 
BE 
g 
z 
6 
oO 


weight (BW) ratio. n = 5 (sham), 8 (TAC plus control anti-miRNA) and 

4 (TAC plus anti-miR-25).h, i, Effect of treatment on endogenous miR-25 levels 
(h; qPCR, n = 4) and SERCA2a protein (i; immunoblotting, n = 3). 

j-u, Masson’s trichrome (j, k, n, 0, r, s), haematoxylin/eosin (I, p, t) and wheat 
germ agglutinin (WGA) (m, q, u) stained sections of hearts and LV wall. 

v, Quantified fibrotic area (n = 3 (sham operated), 4 (TAC plus control 
anti-miRNA) and 5 (TAC plus anti-miR-25)). w, Average cardiomyocyte 
cross-sectional area (n = 3 for all cohorts). For all panels, mean + s.e.m. *, 

+ indicate significant differences (* or ¢P < 0.05, ** or {¢P<0.01, *** or 
+++P < 0.001, Student’s t-test between sham and anti-miRNA (*) or control 
anti-miRNA and anti-miR-25 (7)). NS, not significant. All replicates (n) are 
biological and represent one of two independent experiments. 


beyond SERCA2a deficiency, such as changes in K* channel density, 
NCX expression and myofilament sensitivity to Ca**, also contribute 
to slowing of the Ca’* transient in advanced heart failure”. miR-25 
might also control IP3R1, and thus could be involved under conditions 
of inositol-1,4,5-trisphosphate sensitization, such as in response to endo- 
thelin 1, angiotensin and phenylephrine” or in local Ca** control. 
NADPH oxidase 4 (NOX4) was recently shown to be downregulated 
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by miR-25 in a mouse model of diabetic nephropathy”; however, NOX4 
levels did not change significantly in our long-term TAC heart failure 
model (data not shown). Whether NOX4, which is responsible for the 
production of superoxide, might contribute to the salutary effects of 
anti-miR-25 therapy is unclear as it is reported to confer both protect- 
ive and detrimental effects”*. Finally, miR-25 was recently reported to 
decrease acutely after aortic constriction in mice”’, in contrast with the 
chronic elevation reported here and in human heart failure samples, 
perhaps reflecting the very different stages of pathogenesis. In conclu- 
sion, our data suggest that inhibition of miR-25 may be a novel thera- 
peutic strategy for the treatment of heart failure. 


METHODS SUMMARY 


For primary screening, HEK293 cells were co-transfected with the Ambion Pre-miR 
miRNA Precursor Human v.2.0 microRNA library (25 nM) and 300 ng SERCA2a 
3’ UTR target sensor plasmid (Fig. 1a) per well in 384-well plates, in triplicate. 

For secondary screening, Ca?* transient recordings (10 s, 33 frames per second) 
were acquired from HL-1 cardiomyocytes and NRVCs transfected with candidate 
miRNAs and loaded with Hoechst 33342 and Fluo-4 72h after transfection, and 
analysed to derive cytoplasmic calcium transient kinetic parameters using a KIC 
instrument and software (Vala Sciences). Unresponsive or low responding cells were 
removed by gating. Statistical significance was determined using one-way ANOVA. 

All mice were housed and treated in accordance with guidelines from the National 
Institutes of Health and institutional animal care and use committees, and proto- 
cols were approved by the Mount Sinai School of Medicine or Sanford-Burnham 
Medical Research Institute animal care and use committees. 

LV samples were from explanted hearts of patients with severe heart failure, 
obtained at the time of cardiac transplantation. Non-failing hearts (controls) were 
from patients who died of cerebrovascular accidents with no evidence of contractile 
dysfunction by echocardiography. The 5 non-failing hearts (3 males and 2 females) 
had a median age of 43. The 5 heart-failure patients (4 males and 1 female) had a 
median age of 54 and their mean EF before cardiac transplantation was 22 + 3%. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Foxcl is a critical regulator of haematopoietic stem/ 
progenitor cell niche formation 


Yoshiki Omatsu'’, Masanari Seike!, Tatsuki Sugiyama’, Tsutomu Kume? & Takashi Nagasawa!” 


Haematopoietic stem and progenitor cells are maintained by special 
microenvironments known as niches in bone marrow’ °. Many stud- 
ies have identified diverse candidate cells that constitute niches for 
haematopoietic stem cells in the marrow, including osteoblasts’~"°, 
endothelial cells'*“*, Schwann cells’*, ¢-smooth muscle actin-expressing 
macrophages’* and mesenchymal progenitors such as CXC chemo- 
kine ligand (CXCL)12-abundant reticular (CAR) cells’””*, stem cell 
factor-expressing cells’, nestin-expressing cells’” and platelet-derived 
growth factor receptor-a (PDGFR-a) *Sca-1*CD45~ Ter119~ (PaS) 
cells”°. However, the molecular basis of the formation of the niches 
remains unclear. Here we find that the transcription factor Foxcl is 
preferentially expressed in the adipo-osteogenic progenitor CAR 
cells essential for haematopoietic stem and progenitor cell mainten- 
ance in vivo’’*"* in the developing and adult bone marrow. When 
Foxcl was deleted in all marrow mesenchymal cells or CAR cells, 
from embryogenesis onwards, osteoblasts appeared normal, but hae- 
matopoietic stem and progenitor cells were markedly reduced and 
matrow cavities were occupied by adipocytes (yellow adipose mar- 
row) with reduced CAR cells. Inducible deletion of Foxcl in adult 
mice depleted haematopoietic stem and progenitor cells and reduced 
CXCL12 and stem cell factor expression in CAR cells but did not 
induce a change to yellow marrow. These data suggest a role for 
Foxcl in inhibiting adipogenic processes in CAR progenitors. Foxcl 
might also promote CAR cell development, upregulating CXCL12 
and stem cell factor expression. This study identifies Foxcl as a 
specific transcriptional regulator essential for development and 
maintenance of the mesenchymal niches for haematopoietic stem 
and progenitor cells. 

CAR cells express a high amount of CXCL12 and stem cell factor 
(SCF), which are essential for haematopoietic stem and progenitor 
cell (HSPC) maintenance’*’*'”7!, and are largely equivalent to SCF- 
expressing cells'* and presumably to human CD146-expressing sub- 
endothelial cells’. In vivo CAR cell ablation led to impaired HSPC 
maintenance’*. CAR cells are adipo-osteogenic progenitors, expres- 
sing adipogenic and osteogenic genes, including osterix (Osx)'*. Osx 
has been shown to be essential for osteoblast development and gen- 
eration of both bone and marrow”. However, the molecular basis of 
CAR cell development remains unclear. To address this, we compared 
messenger RNA (mRNA) expressions of a variety of transcription fac- 
tors by quantitative reverse-transcription PCR (qRT-PCR) analysis of 
CAR cells and alkaline phosphatase (ALP) osteoblasts from 14-week- 
old mice with the green fluorescent protein (GFP) reporter gene knocked 
into the Cxcl12 locus (CXCL12-GFP mice)'”. We found that the levels 
of Foxcl in CAR cells were significantly higher than in osteoblasts. 
Foxcl protein belongs to a family of transcription factors characterized 
by the presence of a forkhead box (Fox) DNA-binding domain. qRT- 
PCR analysis of 14- to 20-week-old CXCL12-GFP mice showed that 
CAR cells expressed substantially higher levels of Foxcl than other bone 
marrow non-haematopoietic populations, including osteoblasts, Pas 
cells (PDGER-a«” Sca-1* CD31 CD45" Ter119~ )?°”4 and endothelial 


cells, as well as PaS adipo-osteogenic progenitors in muscle tissue 
(Fig. 1a). PaS cells are enriched for mesenchymal progenitors but dis- 
tinct from CAR cells, which express PDGFR-« but not Sca-1 (refs 18 
and 20). Haematopoietic cells, including HSPCs and macrophages, did 
not express Foxcl mRNA (Fig. 1a). Single-cell RT-PCR analysis showed 
that all individual CAR cells expressed Foxcl in 14-week-old CKCL12- 
GFP mice (50 of 50; 100%). During development, Osx-GFP* PDGFR- phi 
putative CAR progenitors from embryonic day (E)16.5 transgenic mice 
expressing GFP reporter gene under the control of Osx regulatory 
elements (Osx-GEP mice), and CXCL12-GEP* cells from newborn 
CXCL12-GFP mice (Extended Data Fig. 1), expressed substantially 
higher levels of Foxcl than other mesenchymal populations (Fig. 1b) 
and the Foxcl expression in CXCL12-GFP * cells increased over time 
(Fig. 1c). CXCL12-GEP* cells from newborn CXCL12-GEP mice ex- 
pressed PDGFR-§, leptin receptor (Lepr), CXCL12, SCF and peroxisome 
proliferator-activated receptor-y (PPARy), which were preferentially 
expressed in CAR cells'*”’, but distinctly lower than adult CAR cells 
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Figure 1 | Foxcl is preferentially expressed in CAR cells. ac, Foxcl mRNA 
levels in CAR cells, osteoblasts (Ob), PaS cells, bone marrow endothelial cells 
(ECs), KSL cells, macrophages and muscle PaS cells from adult mice (a), and 
in Osx-GFP* PDGER-B", Osx-GEP* PDGER-f'°, CXCL12-GFP* cells, 
osteoblasts, PaS, Osx-GFP PDGFR- B* Sca- 1 CD31 CD45 Terll9” 

(Osx” RB*) cells and ECs from femurs of E16.5 Osx-GEP mice (b) or newborn 
(b, c), 1-week-old, 3-week-old and 14- to 20-week-old CXCL12—GFP mice (c) 
(n = 3). d, Foxcl protein was detected in CAR cells (left, arrows) but not in Pas 
cells (right, arrowheads) or ECs of arteries (right, white arrows) and sinuses 
(right, yellow arrows) in the marrow cavity of 2-week-old CXCL12-GFP (left) 
or wild-type (right) mice. Error bars, s.d. *P < 0.05. 
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and thus were termed primordial CAR cells (Extended Data Fig. Ic, e, f). 
Immunohistochemical analysis of bone marrow cavities with antibodies 
against Foxcl showed that Foxcl protein was detected in CAR cells 
but not in PaS cells or endothelial cells of arteries and sinuses (Fig. 1d 
and Extended Data Fig. 2). These results indicate that Foxcl is 
expressed preferentially in CAR cells in the developing and adult bone 
marrow. Foxcl and Foxc2 are closely related Fox family members. 
Although Foxc2 mRNA levels were higher in CAR cells than other 
marrow mesenchymal populations, the differences in Foxc2 mRNA 
levels between CAR cells and osteoblasts were much smaller than 
those in Foxcl mRNA levels (Fig. la~c and Extended Data Fig. 3). 
Foxcl null mutants die perinatally with haemorrhagic hydroceph- 
alus and calvarial defects”. To determine the roles of Foxcl in bone 
marrow mesenchymal niche formation, we used mice with conditional 
floxed Foxcl alleles (Foxc#” mice)** in conjunction with transgenic 
mice expressing Cre recombinase under the control of Prx1 regulatory 
elements (Prxl1-Cre mice)”’. Prxl-Cre mice express Cre recombinase 
in all mesenchymal cells in developing limbs and parts of the skull but 
not in other organs”’. Prx1-Cre targeted all CAR cells, PaS cells and 
osteoblasts but not haematopoietic and endothelial cells in bone marrow” 
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(data not shown). Prx1-Cre;Foxcl/? mice were born at the expected 
Mendelian ratio and remained viable much longer than Foxcl null 
mutants, although they showed hydrocephalus and did not survive 
beyond 6 weeks of age. Flow cytometric analysis showed that total 
haematopoietic cell counts and the numbers of the CD34~ CD150* 
CD48 subset of c-kit” Sca-1* Lineage (Lin) (KSL) cells, which are 
highly enriched for long-term repopulating haematopoietic stem cells 
(LT-HSCs)", CD34*CD48~ KSL short-term repopulating HSCs (ST- 
HSCs), Flt3* KSL multipotent progenitors (MPPs), Lin IL- 7Rot* Ft3* 
common lymphoid progenitors (CLPs), c-kit*CD19* IgM pro-B cells, 
c-kit CD19*IgM_ pre-B cells, c-kit" CD71 *Ter119"° proerythroblasts 
and c-kit’Sca-1 Lin CD34‘FcyRII/III™ granulocyte/macrophage 
progenitors were severely reduced in the bone marrow of 3-week-old 
Prxl-Cre;Foxc lf mice compared with control animals (Fig. 2a). Addi- 
tionally, the numbers of functional HSCs, including repopulating units, 
based on a competitive repopulation assay, and long-term in vitro 
culture-initiating cells (LTC-IC)”, based on a limiting dilution LTC- 
IC assay, were markedly reduced in the mutant bone marrow (Fig. 2b 
and Extended Data Fig. 4). Flow emer analysis of the bone marrow 
from newborn Prx1-Cre;Foxc ’ + mice showed that the numbers of 
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blood (c) of 3-week-old control and 
Prx1-Cre;FoxcV mice (n = 5). The 
numbers of functional HSCs were 
estimated using repopulating units 
(RUs) (b, n = 3). d-i, k, Histological 
analysis of bone marrow from 
3-week-old (d-i) and 1-week-old (k) 
control and Prx1-Cre;Foxcl 
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CXCL12-GFP mice (e, k). 
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osteoblasts (g, red) and Sca- 
1*CD317 PoS cells (h, arrowheads). 
i, The cortical thickness of femoral 
diaphysis (n = 5). j, Perilipin, Fabp4, 
CXCL12 and SCF mRNA levels in 
residual CAR cells from 3-week-old 
Prx1-Cre;Foxcl and control mice 
(n = 5). k, CAR cells (green) with 
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CD150*CD48~ KSL cells (HSCs), pro-B cells, pre-B cells and pro- 
erythroblasts were reduced although to a lesser extent than juvenile 
mutants (Extended Data Fig. 5). In peripheral blood of Prx1-Cre; 
Foxcl! mice, B cells were decreased but primitive haematopoietic cells 
were not increased (Fig. 2c and data not shown). 

Surprisingly, histological analysis of 3-week-old Prx1-Cre;Foxcl’’ 
mice showed that most marrow volume was occupied by adipocytes 
with few haematopoietic cells, and haematopoietically active marrow 
was restricted to the subendosteal region near the bone surface (Fig. 2d). 
Immunohistochemical analysis of CKCL12-GFP mice with antibodies 
against cytoplasmic protein S100 showed that CAR cells could be iden- 
tified as $100 * cells in the marrow cavity (Extended Data Fig. 6). Immuno- 
histochemical analysis showed that the numbers of CKCL12-GFP* 
or S100* CAR cells were severely reduced (Fig. 2e, f) but Ocn* osteo- 
blasts lining the endosteal bone surface and Pas cells (Fig. 2g, h and 
Extended Data Fig. 7a) appeared normal in the bone marrow of Prx1- 
Cre-targeted mice compared with control animals. Consistent with this, 
Prx1-Cre;Foxc¥! mice had normal or slightly reduced cortical thick- 
ness of the femoral diaphysis (Fig. 2i). (RT-PCR analysis showed that 
residual CAR cells in Prxl-Cre-targeted mice had higher expression 
levels of adipocyte markers, including perilipin and fatty acid-binding 
protein 4 (Fabp4) and lower levels of CKCL12 and SCF expression than 
control animals (Fig. 2j). Histological analysis of bone marrow showed 
that primordial CAR cells appeared normal and few adipocytes were 
observed in newborn Prx1 -Cre;Foxc lf CXCL12-GFP mice (data not 


shown); however, CAR cells that differentiated into multivacuolar lipid- 
containing preadipocytes were detected in the mutants at 1 week of 
age (Fig. 2k). Thus, in the absence of Foxcl during postnatal CAR cell 
development, most primordial CAR cells might differentiate to become 
adipocytes, which could not function as a HSPC niche’®, and thus the 
numbers of CAR cells, which function as niches for HSC homing and 
HSPC maintenance, would be severely reduced, resulting in a reduc- 
tion in HSPCs. 

To analyse the effects of CAR cell-specific disruption of Foxcl, we 
crossed Foxc¥’ mice with Lepr-Cre knock-in mice. Lepr-Cre targeted 
approximately 72% and 92% of CAR cells but not other bone marrow 
cells in the 3-week-old and 14-week-old mutants, respectively** (data 
not shown). Lepr-Cre;Foxcl mice were viable and did not show hydro- 
cephalus. Flow cytometric analysis showed that total haematopoietic 
cell counts and the numbers of LT-HSCs, ST-HSCs, CLPs, pro-B cells, 
pre-B cells and proerythroblasts were severely reduced in the bone 
marrow of 14- to 18-week-old Lepr-Cre;Foxcl mice compared with 
control animals (Fig. 3a). Additionally, the numbers of functional LT- 
HSCs, including repopulating units and LT'C-ICs, were markedly reduced 
in the mutant marrow (Fig. 3b and Extended Data Fig. 8). In contrast, 
LT-HSCs and proerythroblasts were increased in the spleen but not 
in the peripheral blood of the mutants (Fig. 3c and data not shown), 
suggesting that reduction in HSPCs in the marrow might result in extra- 
medullary haematopoiesis. Histological analysis of 18-week-old Lepr- 
Cre;Foxcl! mice showed that most marrow volume was occupied by 
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d-f, Adipocytes (d, red), Ocn* 
osteoblasts (e, red) and Sca- 
1*CD31~ Pas cells (f, arrowheads) 
in the marrow from control and 
Lepr-Cre;Foxc! mice. cKO, Lepr- 
Cre;Foxcl! mice. g-k, Analysis of 
primordial (g, h) and adult (h) CAR 
cells or OP9 cells (i-k) transduced 
with control or Foxcl-expression 
construct. Foxcl, CXCL12 and SCF 
mRNA levels (g, i). Percentages of 
Nile-Red-positive cells (h) or 
Nile-Red-positive area (k, right) 
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adipocytes (Fig. 3d) but endosteal osteoblasts, PaS cells and cortical 
thickness of the femoral diaphysis appeared normal (Fig. 3e, f, Extended 
Data Fig. 7b and data not shown). These results support the idea that 
Foxcl of CAR cells has an essential role in inhibition of their differ- 
entiation to adipocytes and HSPC niche formation. In contrast, HSPC 
maintenance and haematopoiesis were unimpaired in 14- to 18-week- 
old Foxcl! mice in conjunction with Tie2-Cre transgenic mice, in 
which the Foxc1 gene was deleted in endothelial and haematopoietic 
cells (Extended Data Fig. 9), indicating that Foxcl in endothelial and 
haematopoietic cells is not required for haematopoiesis. 

To confirm the in vivo function of Foxcl in CAR cell development, 
we examined the effects of Foxcl expression in CAR cells and an adi- 
pocyte progenitor cell line, OP9, which was derived from newborn 
op/op mouse calvaria”’. Enforced expression of Foxcl by retroviral infec- 
tion in sorted primordial and adult CAR cells increased CXCL12 and 
SCF mRNA expression as assessed by qRT-PCR (Fig. 3g and data not 
shown) and markedly decreased adipogenic potential in the culture 
(Fig. 3h). OP9 cells expressed Foxcl at very low levels and had the poten- 
tial to differentiate into adipocytes in an in vitro culture (Fig. 3i, k). 
Enforced expression of Foxcl in OP9 cells increased CKCL12 and SCF 
mRNA expression and ability to support B lymphopoiesis, and led to a 
loss of adipogenic potential in the culture (Fig. 3i-k). 

To determine the role of Foxcl in the maintenance of HSPC niches, 
we induced deletion of the Foxc! gene in adults by using Foxcl mice 
in conjunction with Ubc-CreERT2 transgenic mice, allowing recombina- 
tion of the floxed Foxc] allele upon tamoxifen treatment (Ubc-CreERT2; 
FoxcF mice). Total haematopoietic cell counts and the numbers of LT- 
HSCs, ST-HSCs, MPPs, CLPs, pro-B cells, pre-B cells and proerythro- 
blasts were severely reduced in the bone marrow of 16- to 22-week-old 
tamoxifen-treated Ubc-CreERT2;Foxc Ll! mice compared with control 
animals (Fig. 4a and data not shown). In contrast, LT-HSCs were increased 
in the spleen of the mutants (data not shown), suggesting that Foxcl 
plays a bone-marrow-specific role in maintaining HSCs in adults. Histo- 
logical analysis of bone marrow showed that only small numbers of 
adipocytes were observed (Fig. 4b) and the numbers of $100* PDGER- 
B* cells were reduced (Fig. 4c) but processes of PDGFR-B* cells forming 


LETTER 


a network appeared normal in tamoxifen-treated Ubc-CreERT2;Foxcl’! 
mice compared with control animals (Fig. 4c). These results indicate 
that many CAR cells, whose niche functions were impaired, were pre- 
sent in the mutants. Immunohistochemical analysis showed that endothe- 
lial cells, osteoblasts, PoS cells (Fig. 4d, e and data not shown) and 
cortical thickness of the femoral diaphysis (Fig. 4f) appeared normal in 
the mutant bone marrow. qRT-PCR analysis showed that residual 
CAR cells had lower levels of CXKCL12 and SCF expression, but slightly 
higher levels of perilipin and Fabp4 expression in tamoxifen-treated 
Ubc-CreERT2;FoxcP mice compared with control animals (Fig. 4g 
and data not shown). These results indicate that Foxcl is essential for 
maintenance of CXCL12 and SCF expression in CAR cells and HSPC 
niche functions of adult marrow microenvironments. Although enforced 
expression of Foxcl in adult CAR cells inhibited adipogenesis in vitro, 
bone marrow adipogenesis remained blocked after induction of Foxcl 
deletion in adults. Further studies will be needed to delineate a cell- 
intrinsic or non-cell-intrinsic mechanism by which adipogenesis is blocked 
in adult marrow. 

We have shown that Foxcl is essential for CAR cell development. 
During postnatal development, primordial CAR cells begin to express 
adipogenic genes (Extended Data Fig. 1f) and upregulate the expression 
of Foxcl, which might promote CAR cell development upregulating 
CXCL12 and SCF expression and subsequently inhibit the differenti- 
ation of developing CAR cells into adipocytes. Our findings, together 
with the knowledge that Foxn1 is essential for the development of thy- 
mic epithelium, which supports T-cell production”, establish that Fox 
proteins play critical but distinct roles in the formation of different hae- 
matopoietic microenvironments. Clinically, it may be possible that Foxcl 
or its activators can be used therapeutically to drive the formation of hae- 
matopoietic niches from non-niche cells, such as primary dermal fibro- 
blasts or induced pluripotent stem (iPS)-derived mesenchymal cells 
in vitro, or from impaired marrow microenvironments in vivo. This 
study provides a valuable starting point in understanding the molecular 
mechanisms of HSPC niche formation and new insights into features 
of haematopoietic microenvironments and mesenchymal progenitors. 
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METHODS SUMMARY 


Foxc¥ mice** and CXCL12-GEP knock-in mice’ have been described previously. 
Details of procedures and reagents are described in Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Akt, also known as protein kinase B, plays key roles in cell prolif- 
eration, survival and metabolism. Akt hyperactivation contributes 
to many pathophysiological conditions, including human cancers’, 
and is closely associated with poor prognosis and chemo- or radio- 
therapeutic resistance*. Phosphorylation of Akt at $473 (ref. 5) and 
T308 (ref. 6) activates Akt. However, it remains unclear whether 
further mechanisms account for full Akt activation, and whether 
Akt hyperactivation is linked to misregulated cell cycle progression, 
another cancer hallmark’. Here we report that Akt activity fluctuates 
across the cell cycle, mirroring cyclin A expression. Mechanistically, 
phosphorylation of $477 and T479 at the Akt extreme carboxy ter- 
minus by cyclin-dependent kinase 2 (Cdk2)/cyclin A or mTORC2, 
under distinct physiological conditions, promotes Akt activation 
through facilitating, or functionally compensating for, $473 phos- 
phorylation. Furthermore, deletion of the cyclin A2 allele in the mouse 
olfactory bulb leads to reduced $477/T479 phosphorylation and ele- 
vated cellular apoptosis. Notably, cyclin A2-deletion-induced cel- 
lular apoptosis in mouse embryonic stem cells is partly rescued by 
S477D/T479E-Akt1, supporting a physiological role for cyclin A2 in 
governing Akt activation. Together, the results of our study show 
Akt S$477/T479 phosphorylation to be an essential layer of the Akt 
activation mechanism to regulate its physiological functions, there- 
by providing a new mechanistic link between aberrant cell cycle pro- 
gression and Akt hyperactivation in cancer. 

Using single live-cell imaging*, we found that Akt activation fluc- 
tuated across the cell cycle, inversely correlating with Cdt1 abundance® 
(Fig. la and Supplementary Fig. 1a). Statistical analysis of immunos- 
tained HeLa cells further showed that Akt-pS473 has a similar periodic 
feature as geminin’’° (Fig. 1b). Notably, in several cancer cell lines 
(Fig. 1c, d and Supplementary Fig. 1b, c), Akt phosphorylation, but not 
total Akt abundance, fluctuated across the cell cycle. The periodic Akt 
phosphorylation mirrored the expression pattern of cyclin A2, the pre- 
dominant mammalian cyclin A isoform”’, during cell cycle progression 
(Fig. 1c, d). Moreover, acute depletion of cyclin A2 or Cdk2, but not 
cyclin E, resulted in decreased Akt phosphorylation, with no significant 
impact on phosphorylation of Akt upstream kinases PDK1 and mTORC2 
(Fig. 1d). This prompted us to evaluate whether Cdk2/cyclin A directly 
regulates Akt activation in a phosphorylation-dependent manner during 
the cell cycle’. 

In support of Akt as a Cdk2/cyclin A substrate, Akt isoforms inter- 
acted with cyclin A2 (Fig. 2a and Supplementary Fig. 2a). Furthermore, 
we identified four ‘RXL’ cyclin A-binding motifs'* in Akt] (Fig. 2b), all of 
which are evolutionarily conserved (Supplementary Fig. 2b). Mutation 
of R76CL or R273DL, and to a lesser extent, R200VL or R370TL (RXL to 
AXA) attenuated Akt] interaction with cyclin A2 (Fig. 2c), and reduced 


Akt] activity (Supplementary Fig. 2c). Consistently, depleting cyclin A2 
or Cdk2 (Supplementary Fig. 2d-f) led to a significant reduction in Akt 
phosphorylation. More importantly, either acute treatment with Cdk2 
inhibitors (Fig. 2d) or deletion of the cyclin A2 allele in cyclin A2" pri- 
mary mouse embryonic fibroblasts (MEFs) (Fig. 2e) led to a marked 
decrease in Akt phosphorylation without a significant perturbation of 
cell cycle progression (Fig. 2d and ref. 14), excluding a possible indirect 
cell cycle effect on Akt phosphorylation by inhibiting Cdk2/cyclin A. 

Notably, deletion of cyclin A2, but not cyclin A] or cyclin E1/E2 alleles, 
caused a significant decrease of Akt phosphorylation (Fig. 2f, g), whereas 
conversely, ectopic expression of cyclin A2 (Fig. 2h and Supplementary 
Fig. 3a) resulted in elevated Akt phosphorylation coupled with enhanced 
in vitro anchorage-independent growth (Supplementary Fig. 3b, c). More- 
over, depletion of Cdh1, the E3 ligase that controls cyclin A turnover’’, 
resulted in increased cyclin A2 abundance and elevated Akt phosphoryla- 
tion, leading to enhanced in vitro anchorage-independent growth (Supplemen- 
tary Fig. 3d, e) and in vivo tumour formation (Fig. 2i and Supplementary 
Fig. 3f-h). More importantly, increased Akt phosphorylation and tumor- 
igenicity by depleting Cdh1 could be partly reversed by extra depletion of 
cyclin A2 (Fig. 2) and Supplementary Fig. 3i-1). Collectively, these results 
support Cdk2/cyclin A2 as a major physiological kinase that governs Akt 
phosphorylation and oncogenic functions. 

Notably, Cdk2/cyclin A directly phosphorylated Akt1 in vitro on its 
carboxy (C)-terminal region (Supplementary Fig. 4a, b). Serial trunca- 
tions showed Cdk2/cyclin A phosphorylation sites in the last four 
evolutionarily conserved residues and subsequent mutageneses pin- 
pointed both $477 and T479 as Cdk2/cyclin A sites (Supplementary 
Fig. 4c-e). Similarly, Cdk2/cyclin A phosphorylated Akt2-S478 (Sup- 
plementary Fig. 4f). Interestingly, mutation of G478 to proline (G478P) 
to mimic the canonical Cdk2 ‘SP/TP’ phospho-motif"®"’, or to other 
bulky amino acids (L/W/R), did not significantly affect Cdk2/cyclin-A- 
mediated Akt phosphorylation (Supplementary Fig. 4g). Conversely, 
C-terminal addition of an «-helix'* (Supplementary Fig. 4g) or a green 
fluorescent protein (GFP) (Supplementary Fig. 4h) reduced Cdk2/ 
cyclin-A-mediated phosphorylation of the engineered non-tail version 
of S477G478-, but not $477P478-Akt1. These data indicate that $477/ 
T479 may belong to a new class of Cdk2/cyclin A phospho-motifs where 
relative structural flexibility at the Akt] C terminus might override the 
requirement ofan adjacent proline for Cdk2-mediated phosphorylation 
of canonical TP/SP sites’*”” typically buried within defined structures. 

Furthermore, mass spectrometry analyses confirmed Akt] $477 and 
T479 phosphorylation’? (Supplementary Fig. 5a-d). To gain further 
mechanistic insights, we developed phospho-specific antibodies that 
recognize pS477/pT479-Akt1 (Supplementary Fig. 6a-f), pS477-Akt1 
or pT479-Akt1 (Supplementary Fig. 7a—h) in vivo. Owing to the large 
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Figure 1 | Akt activity fluctuated during the cell cycle and mirrored 

the periodic cyclin A expression pattern. a, Single live-cell imaging of the Akt 
activity reporter (green, EGFP-Akt AR (activity reporter)) and the cell 

cycle marker Cdt1 (red, mCherry—Cdt1) transiently expressed in non- 
synchronized HeLa cells. b, A representative heat map of cell-cycle-dependent 
Akt1-pS473 as a function of DNA content (x axis) and geminin expression 
as an indicator of cell cycle stages (y axis). Blue, low Akt-pS473; red, high 
Akt-pS473. Number of cells was more than 500. c, Akt phosphorylation 


consistency among three antibodies under our experimental condi- 
tions, we focused on examining the correlation of Akt1-pS477/pT479 
and Akt activation in the remainder of the studies. As with pS473- 
Akt1, Akt1 tail phosphorylation fluctuated during the cell cycle (Fig. 1c, 
d and Supplementary Fig. 1b, c), and was subjected to regulation by 
Cdk2/cyclin A (Fig. 2d-h). Furthermore, a positive correlation between 
cyclin A2 expression and Akt1-pS477/pT479 was observed in breast 
cancer patient samples (Fig. 2k and Supplementary Fig. 8a, b). More 
importantly, similar to the reported phosphorylation of Akt-S473 by 
several upstream kinases dependent on upstream stimuli°”®*', Akt1- 
pS477/pT479 could be mediated by mTOR (Supplementary Figs 9a-c, 
10a-g and 1la-c) or DNAPK (Supplementary Fig. 9d, e), in addition to 
Cdk2/cyclin A. In keeping with this notion, elevated Akt1-pS477/pT479 
was detected in Pten heterozygous MEFs (Supplementary Fig. 10h). Inter- 
estingly, Akt1-pS477/pT479 was negatively regulated by PTEN in mice ina 
tissue-specific manner and largely with Akt-pS473 (Supplementary Fig. 10i). 

Moreover, inactivation of mTORC2 by depleting Rictor led to a 
more dramatic reduction of Akt1-pS477/pT479 in response to insulin 
(Supplementary Fig. 11a) than under synchronized cell cycle condi- 
tions (Supplementary Fig. 12a). Conversely, depletion of Cdk2, but not 
Rictor (Supplementary Fig. 12a, b), resulted in a more robust reduction 
in Aktl-pS477/pT479 across the cell cycle. These findings suggest that 
Akt1-pS477/pT479 is possibly mediated by Cdk2/cyclin A, mTORC2 
or DNAPK, under cell cycle progression, growth factor stimulation or 
DNA damaging conditions, respectively (Supplementary Fig. 13a, b). No- 
tably, depletion of Cdk2, cyclin A2 or Rictor in human primary foreskin 
fibroblasts all resulted in reduced Aktl-pS477/pT479 (Supplementary 
Fig. 14a-c), highlighting both Cdk2/cyclin A and mTORC2 as upstream 
physiological kinases governing Aktl-pS477/pT479. We therefore next 
evaluated the contribution of Akt1-pS477/pT479 to Akt kinase activation 
under various cellular conditions. 
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fluctuated across the cell cycle. Immunoblot (IB) of whole-cell lysates (WCLs) 
derived from T98G cells synchronized by double thymidine block and released 
to normal cell cycle for the indicated periods. d, Depletion of cyclin A, but 
not cyclin E, led to reduced Akt phosphorylation across the cell cycle. 
Immunoblot of whole-cell lysates derived from HeLa cells synchronized by 
nocodazole and released for the indicated periods. Where indicated, short 
interfering RNA (siRNA) oligonucleotides were transfected into cells 24h 
before synchronization. 


In keeping with the notion that Akt1 tail phosphorylation is required 
to achieve full Akt1 activation, the phospho-deficient Akt1-S477A/T479A 
(Akt1-AA) mutant showed a dramatic suppression, whereas a phospho- 
mimetic S477D/T479E (Akt1-DE) mutant showed enhanced Akt1 phos- 
phorylation and activation (Fig. 3a—-c and Supplementary Fig. 15a-c). 
However, caution needs to be taken in interpreting these results, as these 
mutations might not fully recapitulate the in vivo phosphorylation status. 
Strikingly, Aktl-AA was defective in pS473 even in the myristoylation- 
tagged constitutively active Aktl (Fig. 3d), advocating a critical role for 
Akt1-pS477/pT479 in activating Akt (Supplementary Fig. 16a-c). Con- 
sistently, S477D/T479E effectively rescued Akt-pS473 and kinase activity 
of the cyclin A binding motif-defective mutant, R76A (Fig. 3e, f), as well as 
its anchorage-independent growth ability (Supplementary Fig. 16d). How- 
ever, the rescue effect of DE on Akt1-R76A was not due to changes in its 
affinity with PIP3 (Supplementary Fig. 16e). 

Consistent with $477/T479 being Cdk2/cyclin A sites, ectopic expres- 
sion of Akt1-DE partly rescued cell cycle defects observed in quadruple 
knockout MEFs (cyclin El /~/cyclin E2/“/cyclin Al‘ /cyclin A2") 
after Cre infection (Fig. 3g and Supplementary Fig. 16f). Compared with 
the periodic phosphorylation of wild-type Aktl (Akt1-WT), Aktl-AA 
was severely impaired, whereas Akt1-DE showed an elevated and con- 
stitutive $473/T308 phosphorylation across the cell cycle (Fig. 3h). More 
importantly, under several physiological conditions (Supplementary 
Fig. 17a-c), Akt-pS473/pT308 was severely compromised in Akt1-AA, 
but robustly elevated in Akt1-DE expressing cells. Consistently, deple- 
tion of cyclin A2 in Akt1-WT, but not Akt1-DE expressing cells, led to a 
significant reduction in Akt phosphorylation (Supplementary Fig. 17d) 
and decreased tumour formation in vivo (Fig. 3i and Supplementary 
Fig. 17e-g). Cumulatively, these data demonstrate that in response to 
several upstream signals, phosphorylation of Akt1-S477/T479 may gov- 
ern the canonical Akt-pS473 to promote Akt activation. Thus, we next 
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Figure 2 | Cdk2/cyclin A2 functioned as a physiological kinase 


Levels of Akt1-pS477/pT479 


infection. f, Immunoblot of WCLs derived from indicated immortalized MEFs 


phosphorylating Akt1 at both $477 and T479. a, Akt] interacted with cyclin _ with or without Cre viral infection. g, Immunoblot of WCLs derived from 


A2 at endogenous levels. Immunoblot analysis of WCLs and anti-Akt1 


2iit 


cyclin A2"" mouse embryonic stem (ES) cells stably expressing Cre-ER 


immunoprecipitations (IP) derived from MCF7 or T47D cells. b, Illustration of treated with 2 pg ml’ tamoxifen (4-OHT) for the indicated days. 


four putative cyclin A binding motifs (RXL) in human Aktl1. c, Deficiency 
in cyclin A binding led to attenuated Akt phosphorylation. Immunoblot 

of WCLs and human influenza hemagglutinin (HA)-immunoprecipitations 
derived from HeLa cells transfected with indicated HA-Akt1 constructs. 


h, Immunoblot of WCLs derived from HeLa cells stably expressing inducible 
pTRIPZ-cyclin-A2 treated with 500 ng ml! doxycycline for the indicated 
periods. i, j, MDA-MB-231 cells depleted of Cdh1 by two independent shRNAs 
(i), or depleted of both Cdh1 and cyclin A2 (j), were injected into nude 


d, Immunoblot of WCLs derived from HeLa cells treated with indicated Cdk2 _ mice (m = 10 for each group) and monitored for tumorigenesis in vivo. 


inhibitors at various time points. Drug doses used were roscovitine (40 11M), 
CVT-131 (0.5 uM) and mimosine (50 uM). e, Immunoblot of WCLs 
derived from cyclin A2‘" primary MEFs with or without adenoviral-Cre 


*P < 0.05 (Student’s t-test). k, Relative percentages of 50 samples examined 
from patients with breast cancer bearing the indicated cyclin A2 and 
Akt1-pS477/pT479 status indexed by either low or high. 


evaluated the precise molecular mechanism(s) linking Akt1-pS477/pT479 | Mechanistically, enhanced phosphorylation of Aktl-DE at $473 by 


to pS473 and subsequent Akt activation. 


As Akt2 crystal structures are reported in great detail’** 


mTOR may be in part due to its increased interaction with Sin1 (Fig. 4c) 


, we next andmTOR, but not due to changes in its association with the Akt inhibitor 


focused on understanding how Akt2-pS478 (equivalent to Akt1-pS477; | CTMP” or phosphatase PHLPP2 (ref. 26) (Supplementary Fig. 19c, d). 
Supplementary Figs 4d and 18a) modulates Akt2 kinase activity. Notably, | However, phospho-mimetic mutation of $473 (S473D) (Supplementary 
Akt2-pS478 functioned synergistically with Akt2-pS474 (equivalent to Fig. 20a) or its mimicking peptide library (Supplementary Fig. 20b) led to 
Akt1-pS473) to allosterically activate Akt2 (Fig. 4a and Supplementary reduced Akt] tail phosphorylation, suggesting that the p$477/pT479 event 
Fig. 18b). Mechanistically, Akt2-pS478 may create a new charge-charge may precede pS473. 
interaction between S478D and R208 to stabilize Akt2 in its closed, active Mutation of either S473 or S477/T479 to alanine significantly redu- 
form (Supplementary Fig. 18c, d). Consistently, deletion of the Akt1-tail ced Akt activity towards phosphorylating Skp2 or crosstides, whereas 
after amino acid 475 (termed 476A) led to significantly increased Akt phos- _no significant additive effect was observed in $473A/S477A/T479A 
phorylation (Supplementary Fig. 18e), suggesting that either phosphoryla- | (Supplementary Fig. 21a, b), arguing for a possible functional redund- 
tion or deletion of the tail region could lock Akt in its active conformation. ancy between pS477/pT479 and pS473 for Akt catalytic activity. Consis- 
Intriguingly, in addition to stabilizing Akt, Akt tail phosphorylation _ tently, Akt1-DE kinase activity was largely unchanged after depletion 
may alter Akt kinase kinetics to accelerate the kinase reaction (Supplemen- _ of Rictor (Supplementary Fig. 21c, d), whereas Akt1-DE partly res- 
tary Fig. 18f). Furthermore, pS473 and pS477/pT479, both of which activ- cued the deficient Akt kinase activity in Akt1-S473A (Supplementary 
ate Akt, are intrinsically linked; as compared with Aktl-WT, Aktl-DEbut — Fig. 21e, f). Given the close proximity of $473 and S477/T479, these 
not Akt1-AA was preferentially phosphorylated by mTOR in vitro (Sup- _ results indicate that in addition to promoting pS473, pS477/pT479 may 
plementary Fig. 19a, b). This indicates that p$477/pT479 might prime Akt1 also trigger Akt activation independently of, or partly compensate for, 
for mTORC2-mediated phosphorylation of $473 (refs 5, 24) (Fig. 4b). pS473. 
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Figure 3 | Akt1-S477/T479 phosphorylation triggered Akt1-S473 
phosphorylation and enhanced Akt1 activation. a, Aktl-S477D/T479E 
mutation led to elevated Akt-pS473. Immunoblot of WCLs and HA- 
immunoprecipitations derived from HeLa cells transfected with indicated Akt1 
constructs. EV, empty vector. b, In vitro kinase assays of indicated affinity- 
purified Akt1 kinases with crosstide as a substrate. Experiments were 
performed in triplicate; data are shown as mean = s.d. c, Aktl-S477D/T479E 
mutation resulted in enhanced Skp2 phosphorylation. Immunoblot of 

WCLs and Flag-immunoprecipitations derived from HeLa cells transfected 
with indicated Akt1 constructs together with Flag—Skp2. d, e, Immunoblot of 
WCLs and HA-immunoprecipitations derived from HeLa cells transfected 
with indicated Akt1 constructs. f, In vitro kinase assays of indicated affinity- 
purified Akt1 kinases with crosstide as a substrate. Experiments were 
performed in triplicate; data are shown as mean = s.d. g, Relative 
5-bromodeoxyuridine (BrdU) incorporation for quadruple knockout MEFs 
(cyclin E1-/~/ E2-/~/A1 ‘/A2") expressing indicated Akt1 constructs that 
were further infected with or without Cre to delete the cyclin A2 alleles. 

h, Akt1-S477D/T479E mutation led to sustained Akt phosphorylation across 
the cell cycle. Immunoblot of WCLs derived from HeLa cells transfected with 
indicated Akt1 constructs, synchronized by nocodazole and released for 
indicated periods. i, Akt1-depleted HeLa cells stably expressing Aktl-WT or 
-DE were subjected to cyclin A2 knockdown by lentiviral shRNA infections. 
The resulting cell lines were subcutaneously injected into nude mice (n = 10 for 
each group) and monitored for tumorigenesis. P values were calculated by 
Student’s t-test. 


Biologically, phosphorylation of the Akt1 tail activated Akt1, lead- 
ing to elevated Skp2 or FOXO phosphorylation, which further pro- 
moted cell cycle progression (Supplementary Fig. 22a—c), or conferred 
resistance to the chemotherapeutic agents etoposide or camptothecin 
(Supplementary Fig. 22d, e), respectively. More importantly, compared 
with Aktl-WT, Akt1-DE-expressing cells showed growth advantage in 
both in vitro soft agar (Fig. 4d, e) and in vivo tumour formation (Fig. 4f, 
g and Supplementary Fig. 23a, b) assays, supporting a role for Akt1- 
pS477/pT479 in promoting Akt] activation and signalling phenotypes 
associated with malignancy. Notably, Aktl-476A phenocopied Akt1- 
DE by showing elevated Akt phosphorylation (Supplementary Fig. 18e), 


544 | NATURE | VOL 508 | 24 APRIL 2014 


a b c 
24,000 igh > 
2 4 
= 18,000 Homo Aktl F SYSASGTA a = WO HA-Aktt 
7 Peptide mix + + + Flag-Sint 
& 12,000 @ =o 8 ~ ==) IB: HA 
2 65 to = 
5 6,000 Zu 0 2 |] me ee | IB: Flag 
3 
8 eee 
0 <x - «| |B: 
None No phos pS474 pS478 pS474/pS478 = IB: Flag 
——__ 2 a . 
Akt2 C-tail peptide = ee | ae) ena 
Akt2 tail No phos: HFPQFSYSASIRE gs 
Akt2 tail pS474: HFPOFpSYSASIRE 5 & 50 il -- IB: HA 
Akt2 tail pS478: HEPQFSYSApSIRE 8 95 gS = 
Akt2 tail pS474/pS478: HFPQFpSYSApSIRE ao 0 [== IB: Flag 
d E f = 800 
: ~ EV I 
= 6004 * wr 
i= * } P<0.005 
2 400 aA 
s @ DE ] F < 0.005 
D 2 200 ' 
— 
5 
= 
5 10 15 20 
Time (d) 
= P <0.005 
es g 600 —— 
& 200 B P-<0.005 
€ — | x 
= B 400) ‘ x xEV 
5 100 E | t xe WT 
3 3 200] « t a AA 
2 E J { x DE 
3 Fog 
o 0 
ac Ev WT AA DE 
h Olfactory bulb Cerebellum Forebrain Tissue 
Aotl* Aptis Agvs Aptis apts Agvs ats pil ANA 
1212341234 1212341234 1212341234 # 4Mouse 
pS477/pT479 
——————— | IB: Akt-pS473 
ees | 1B: AK 
om —o ies tt ee yd?ye ty IB: cleaved 
pee caspase3 
eeeeee ~ " IB: cyclin A2 
po! bof! 
Se || em mm rrr er | IB: tubulin 
i j A2tt Cre-ER 


Ev DE AA pBabe-Myr-Akt1 


0 240 24 02 4 4-OHT (day) 


80 ——- ee IB: Akt-pS473 
= 66 a er i: Akt-pt308 
gS CE To 
= a : a1 
Oo 
g 40 — ee IB HA 
fo} 
a —s oo . 
<x 49 IB: cyclin A2 
i ee ee ae ae me MIB: tubulin 
) 
men obel Ghee Az!- 
= 4 = + = + 40OHT eS - -_ pa 
Ev DE AA Akt1 mutants Genotyping 


Figure 4 | Akt tail phosphorylation triggered Akt activation to promote Akt 
oncogenic functions. a, Stimulation of purified recombinant pAPH-Akt2-AC 
kinase activity by indicated Akt2 C-terminal tail peptides. pAPH-Akt2-AC, 
Akt2 residues 146-460, lacking the PH domain and C-terminal tail with T309 
phosphorylated. Experiments were performed in triplicate; data are shown 

as mean + s.d. No phos, no phosphorylation. b, mTOR in vitro kinase 

assays with degenerate peptide libraries as substrates. Experiments were 
performed in triplicate; data are shown as mean = s.d. c, Akt1-S477D/T479E 
mutation led to enhanced binding with Sin]. Immunoblot of WCLs 

and HA- or Flag-immunoprecipitations derived from Akt1-depleted 

HeLa cells transfected with indicated Akt1 constructs together with 
Flag-Sin1. d, e, Soft agar assays using Akt1-depleted HeLa cells stably 
expressing WT-, AA- or DE-Akt1. f, g, Akt1-depleted HeLa cells stably 
expressing WT-, AA- or DE-Akt1 were injected subcutaneously into nude 
mice (n = 10 for each group) and monitored for tumorigenesis (f). Tumours 
were dissected and weighed (g). *P < 0.05 (Student’s t-test). h, Immunoblot 
analyses of the indicated mouse brain tissues derived from mice with the 
indicated cyclin A2 genetic status. i, Fluorescence-activated cell sorting 
analysis of cyclin A2"' mouse embryonic stem cells stably expressing Cre-ER in 
the presence of 2 1g ml” tamoxifen for 4 days. j, Immunoblot analysis of 
WCLs derived from various indicated embryonic stem stable cell lines 
generated in i. 
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probably through enhanced interaction with mTORC2 (Supplementary 
Fig. 24a), thereby promoting in vitro anchorage-independent growth and 
in vivo tumorigenesis (Supplementary Fig. 24 b-f). 

To study the in vivo physiological significance of Cdk2/cyclin-A- 
mediated phosphorylation on Akt1-S477/T479 further, we generated 
brain-specific cyclin A2 knockout mice with nestin-Cre’’. Interest- 
ingly, a significant reduction in Aktl-pS477/pT479 was observed in 
cyclin A2“ olfactory bulbs coupled with elevated cleavage of caspase 3 
(Fig. 4h), suggesting that Aktl-pS477/pT479 might be critical for cell 
survival in olfactory bulbs. Consistently, acute ablation of cyclin A2 in 
cyclin A2"* mouse embryonic stem cells led to induced cellular apo- 
ptosis (Fig. 4i and Supplementary Fig. 25a, d), providing a possible 
explanation for previous findings that deleting cyclin A2 abolished 
embryonic stem cell colony formation in vitro'*. Furthermore, cyclin 
A2-deletion-induced elevation of cellular apoptosis in mouse embry- 
onic stem cells could be partly rescued by expressing Akt1-DE, but not 
Akt1-AA (Fig. 4i, j and Supplementary Fig. 25), supporting the idea that 
cyclin A2 may govern cellular survival in vivo largely by promoting Akt 
activation. 

To extend these findings and their clinical relevance to human path- 
ophysiology, we observed a positive correlation between Akt1-pS477/ 
p1T479 and Akt1-pS473 in samples from patients with breast cancer 
and breast-cancer-derived cell lines (Supplementary Fig. 26a-e). Inter- 
estingly, high levels of pS477/pT479 occurred at a relatively higher rate 
than pS473 in an earlier breast cancer developmental stage (stage II) 
(Supplementary Fig. 26d), indicating that pS$477/pT479 may serve as a 
better biomarker for early-stage breast cancer detection, although fur- 
ther investigations are warranted. 

Taken together, our data unravel a new phosphorylation event on 
Akt] at its extreme C-terminal residues, $477 and T479, to trigger Akt1 
activation either through enhancing the association between Akt] and 
mTORC2 to promote pS473, or by functionally compensating for 
pS473 to lock Akt1 in its active conformation. More importantly, our 
study directly couples Akt activity with cell cycle progression, two well- 
characterized hallmarks of human cancers***. In this regard, our data 
suggest that cyclin A2 overexpression might exert its physiological func- 
tions in part by directly phosphorylating and activating Akt to trigger its 
pro-survival and oncogenic functions. 


METHODS SUMMARY 


Expression plasmid constructs, proteins, antibodies, cell lines and the sequences of 
various short hairpin RNA (shRNA) oligonucleotides used in this study are des- 
cribed in Methods. All mutants were generated using mutagenesis PCR and the 
sequences were verified by DNA sequencing. Several pharmacological kinase inhi- 
bitors were used to screen for the kinases responsible for the Akt1-S477/T479 
phosphorylation. The in vivo Akt1-S477/T479 phosphorylation events were iden- 
tified by mass spectrometry and examined by in vitro kinase assays. The functional 
roles of Akt1-S477/T479 phosphorylation on Akt-S473 phosphorylation and kinase 
activity were examined either by co-immunoprecipitation or in vitro kinase assays. 
Cell viability assays were used to detect the Akt activity conferred by the Akt-S477/ 
T479 phosphorylation status. Soft agar and mouse xenograft assays were performed 
to analyse physiologically how the Akt1-S477/T479 phosphorylation contributes to 
elevated oncogenic activity to facilitate anchorage-independent growth and in vivo 
tumour growth. Immunohistochemistry on clinical samples from patients with breast 
cancer was performed to illustrate a significant positive correlation between Akt-pS473 
and Akt1-p$477/pT479. Brain-specific cyclin A2““ mice were generated by crossing 
cyclin A2"* mice with cyclin A2"* -nestin-Cre mice, described in detail in Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Nectar secretion requires sucrose phosphate 
synthases and the sugar transporter SWEET9 


I Winnie Lin'*, Davide Sosso'?, Li-Qing Chen’, Klaus Gase*, Sang-Gyu Kim’, Danny Kessler®, Peter M. Klinkenberg’*¥, 
Molly K. Gorder*+, Bi-Huei Hou”, Xiao-Qing Qu”’, Clay J. Carter", Ian T. Baldwin? & Wolf B. Frommer’? 


Angiosperms developed floral nectaries that reward pollinating insects’. 
Although nectar function and composition have been characterized, 
the mechanism of nectar secretion has remained unclear’. Here we 
identify SWEET9 as a nectary-specific sugar transporter in three 
eudicot species: Arabidopsis thaliana, Brassica rapa (extrastaminal 
nectaries) and Nicotiana attenuata (gynoecial nectaries). We show 
that SWEETS is essential for nectar production and can function as 
an efflux transporter. We also show that sucrose phosphate synthase 
genes, encoding key enzymes for sucrose biosynthesis, are highly 
expressed in nectaries and that their expression is also essential for 
nectar secretion. Together these data are consistent with a model in 
which sucrose is synthesized in the nectary parenchyma and subse- 
quently secreted into the extracellular space via SWEET9, where sucrose 
is hydrolysed by an apoplasmic invertase to produce a mixture of 
sucrose, glucose and fructose. The recruitment of SWEET9 for sucrose 
export may have been a key innovation, and could have coincided 
with the evolution of core eudicots and contributed to the evolution 
of nectar secretion to reward pollinators. 

Plants have evolved anatomical and physiological features for attract- 
ing animals to promote pollination. Reproductive isolation and thus 
speciation is thought to be enhanced in animal-pollinated species rela- 
tive to those reliant on wind pollination’. Floral traits, including animal 
pollination, can affect subsequent species abundance within clades’. 
When de Saporta, Hooker, Heer and Darwin discussed the “abomina- 
ble mystery”—the apparent rapid radiation of angiosperms and insects 
in the mid-Cretaceous period, de Saporta suggested that the develop- 
ment and refinement of insect-assisted pollination through the coevo- 
lution of pollinators and flowering plants may have been key to pollinator 
and angiosperm diversification*. Yet the molecular mechanism of nec- 
tar secretion has remained elusive’. 

Flowering plants evolved intricate methods to secure efficient inter- 
action with pollinators and, thereby, successful reproduction and genetic 
diversity through cross-pollination. Nectar, which contains sugars and 
volatile compounds that attract and reward pollinators’ and toxins that 
repel unwanted visitors®, compels pollinators to optimize outcrossing 
rates’. Nectar composition varies between species of plants, possibly to 
reward different animal species”. Depending on the plant species, sucrose, 
glucose and fructose make up 8-80% (w/w) of nectar. Angiosperm 
nectar is synthesized and secreted by different types of floral nectaries 
(FNs) and extrafloral nectaries (EFNs). N. attenuata (coyote tobacco), 
a self-compatible, hawkmoth- and hummingbird-pollinated asterid, 
produces nectar containing sucrose, hexoses and numerous secondary 
metabolites®*. B. rapa (turnip), comprising self-compatible and incom- 
patible varieties, produces hexose-dominant nectar®"®. A. thaliana, a 
self-compatible self-fertilizer, develops functional nectaries that produce 
volatiles and hexose-rich nectar?”””. Whether nectar production in self- 
fertilizing plants represents an evolutionary remnant or functions to 
secure the observed low outcrossing rates has yet to be established”. 


To identify transporters involved in nectar secretion, we searched for 
candidate sugar transporters specifically expressed in nectaries. SWEET 
sugar transporters appeared as prime candidates for a role in nectar 
secretion’*”*: for example, A. thaliana (At)SWEET11 and 12 are respon- 
sible for sucrose efflux, a key step in sucrose translocation from photo- 
synthetic tissues to heterotrophic tissues. A petunia SWEET9 homologue 
(PhNEC1; Extended Data Figs 1, 2) had been found in nectaries, and 
expression correlated inversely with nectary starch content’®; however, 
its role remained elusive until the Arabidopsis and rice homologues 
were shown to function as sugar transporters’. Here we show the crit- 
ical role of SWEET9 in nectar secretion by confirming its expression in 
nectaries, sucrose transport activity, and localization at the plasma mem- 
brane. Mutation of AtSWEET9 or nectary-expressed sucrose phosphate 
synthase (SPS) genes led to the loss of nectar secretion. Sugars delivered 
to defective nectaries accumulated in the stems at the floral base, indi- 
cating the lack of negative feedback on phloem delivery and the inability 
to relocate incoming sucrose. We demonstrate that SWEET9 is either 
conserved or has been independently co-opted in rosids (Arabidopsis, 
turnip) and an asterid (coyote tobacco) for nectar secretion. 

AtSWEET9, which shares ~50% identity with AtSWEET11 or 12 
(clade 3 of the SWEET family; Extended Data Figs 1, 2), is highly expressed 
in Arabidopsis nectaries’” (Extended Data Fig. 3). Transport studies in 
Xenopus oocytes showed that AISWEET9 mediates sucrose uptake and 
efflux (Fig. 1a, b). The sucrose transport activity of AtSWEET9 was con- 
firmed by coexpression of ASWEET9 with fluorescence resonance 
energy transfer (FRET) sucrose sensors in human embryonic kidney 
cells’, indicating that ASWEET9 functions as a facilitated diffusion 
transporter for sucrose’’. ASWEET9 weakly transports glucose; pos- 
sibly contributing to hexose efflux (Extended Data Fig. 4). 

To determine directly whether AtSWEET9 is involved in nectar secre- 
tion, we examined nectar in transfer DNA (T-DNA) insertion mutants. 
Depending on whether AtSWEET9 has a role in sugar uptake into or 
efflux from nectaries, nectar from mutants would either contain reduced 
sugar content or lose fluid secretion. Atsweet9 contained no detectable 
nectar droplets (Fig. 1c, d, Extended Data Fig. 5 and Supplementary 
Table 1). To test whether AtSWEET9 activity limits nectar secretion, 
extra AtSWEET9 copies expressed from the native promoter in wild- 
type plants increased nectar volume and glucose amount (Fig. le and 
Supplementary Table 2). Restoration of nectar secretion by AISWEET9 
or AtSWEET9 tagged with green fluorescent protein (GFP) in Atsweet9 
mutants further supports the role of SWEET9 in nectar secretion (Extended 
Data Fig. 5). 

SWEET9 could function in (1) sucrose efflux from phloem, (2) sugar 
uptake into, or (3) sugar efflux from nectary parenchyma. Translational 
B-glucuronidase (GUS) and enhanced (e)GFP fusions driven by the 
AtSWEET9 promoter were specifically expressed in nectaries (Fig. 2a—c), 
with the highest levels seen in the lower half of nectary parenchyma, 
but not in guard cells and phloem. The fluorescence intensity of 
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Figure 1 | AtSWEET9, a sucrose transporter, is necessary for nectar 
secretion. a, b, Sucrose uptake (a) and efflux (b) activity of AtSWEET9 

and BrSWEET9 were determined in Xenopus oocytes. Truncated 
AtSWEET9(F201*) (ASWEET9m) and BrSWEET9(L201*) (BrSWEET9m) 
served as negative controls. a, Oocyte uptake assay: ASWEET9 and 
BrSWEET9 mediate 4C-sucrose uptake (error bars show + standard error of 
the mean (s.e.m.), 1 = 14, n = 16). *P < 0.05, **P < 0.01, Student's ¢-test. 

b, Oocyte efflux assay: '*C-sucrose efflux by AtSWEET9 and BrSWEET9 in 
Xenopus oocytes injected with '“C-sucrose (error bars show + s.e.m., n = 9, 

n = 8).c, Nectar droplet clinging to inside of sepal (wild type: Col-0). d, Lack of 
nectar in nectaries of Atsweet9-1 mutants. e, Increased nectar secreted from 
nectaries of flowers containing extra copies of ASWEET9-eGFP, driven by the 
AtSWEET9 promoter (pAtSWEET9). c-e, Original magnification, x 10. 


AtSWEET9-eGFP in floral nectaries increased during maturation, becom- 
ing highest during maximal nectar secretion (Extended Data Fig. 6). 
These data do not support a role for AISWEET9 in phloem unloading, 
but do not enable us to distinguish between roles in uptake versus efflux 
from nectarial parenchyma. We conjectured that starch accumulation 
in Atsweet9 nectaries would be different if SWEET9 were involved in 
uptake (no starch accumulation in nectary) versus cellular efflux from 
nectarial cells (accumulation in nectary due to export deficiency). In wild- 
type plants, starch accumulates in plastids of nectary parenchyma before 
anthesis and is degraded to produce sugars for secretion’®’*. Mutants 
accumulated starch in all cells of the nectary parenchyma, indicating that 
AtSWEETS is responsible for cellular sugar efflux (Fig. 2d-g). Interest- 
ingly, at anthesis, nectarial guard cells of wild-type, but not Atsweet9 
plants, contained starch granules, possibly due to reabsorption of sugars 
from nectar’’. The functional characterization of ASWEET9 as a sucrose 
efflux transporter, the expression of SWEET9 in nectary parenchyma 
and the starch accumulation pattern in mutants intimate AtSWEET9 
as the key transporter for nectarial sugar secretion. The involvement of 
a putative facilitated-diffusion carrier indicates that high cytosolic levels 
of sugars in the nectarial parenchyma together with extracellular hydro- 
lysis of sucrose by apoplasmic invertase create the driving force for 
nectar secretion’. 

Whether sugar secretion is mediated by plasma membrane carriers 
or exocytosis has been a matter of debate””. AtSWEET9-eGFP fusions 
localized at the plasma membrane, the trans-Golgi apparatus and mul- 
tivesicular bodies (Fig. 2c, Extended Data Fig. 7 and Supplementary 
Videos 1-3). Given the strength of the SWEET9 promoter, accumula- 
tion in the trans-Golgi/multivesicular bodies is probably caused by high 
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Figure 2 | Cellular and subcellular localization of AtSWEET9 and starch 
accumulation in Atsweet9 mutants. a, b, Histochemical GUS analysis in 
Arabidopsis flowers expressing translational GUS fusion of AtSWEET9 (native 
promoter). Vertical (a) and transverse (b) sections of Arabidopsis flowers show 
tissue-specific localization of ALSWEET9. Cell walls stained with safranin-O 
(orange). c, Confocal images of eGFP fluorescence of an AtSWEET9 
promoter:AtSWEET9-eGFP fusion showing subcellular localization at the 
plasma membrane and trans-Golgi network. d, e, Close-up of nectaries from 
wild-type and Atsweet9-1 flowers stained with Lugol’s iodine solution. Starch 
accumulated only in the guard cells of wild-type nectaries and in nectary 
parenchyma in Atsweet9-1 (sampled at the end of the dark period). f, g, LR 
White resin sections of Arabidopsis nectaries in wild type and Atsweet9-1 
mutants stained with Lugol’s iodine solution. Starch grains (dark red) 
accumulate in nectaries of Atsweet9-1 mutants (g) and in stomata of wild-type 
nectary (f, indicated by an asterisk). Starch grains in floral stalks and nectaries 
in wild-type and Atsweet9 mutant lines at anthesis. Cell walls stained with 
safranin-O (orange). 


expression levels”. Thus, although it is conceivable that ASWEET9 
loads vesicles with sugars for exocytosis, plasma-membrane-mediated 
efflux is more likely. Given that multiple clade 3 SWEETs transport 
sucrose, we explored whether the well-characterized plasma-membrane- 
localized paralogues AtSWEET11 and 12 (ref. 15) can complement the 
loss of nectar secretion in Atsweet9. When expressed from the AtSWEET9 
promoter, both transporters restored nectar secretion (Extended Data 
Figs 5g, h, 6j-k and Supplementary Video 4), indicating that a sugar 
efflux transporter driven by a promoter that expresses it in nectaries 
suffices for sugar efflux. 

Gene expression analysis in Arabidopsis has shown that sucrose bio- 
synthesis genes are upregulated in nectaries”’, intimating that resynthesis 
of sucrose derived from starch fuels sugar efflux. We found that two 
SPS genes, SPS1F and SPS2F, which encode key enzymes for sucrose 
biosynthesis, were highly expressed in maturing nectaries (Extended 
Data Fig. 8). Artificial microRNA (amiRNA) inhibition of SPS gene 
expression led to loss of nectar secretion and increased starch accumula- 
tion (Fig. 3a—d). Thus, the phenotypes of Atsweet9 mutants, SPS amiRNA 
lines and the nectarial apoplasmic invertase mutant Atcwinv4-1 (ref. 22) 
are similar. The most parsimonious hypothesis is thus that starch-derived 
sucrose synthesized in nectaries is exported by SWEET9, and that sucrose 
hydrolysis by CWINV4 is necessary to create a large enough osmotic 
gradient to sustain water secretion (Fig. 3e). Species secreting nectar 
with a very high sucrose content may use additional mechanisms for 
creating a sufficient osmotic gradient. 

To explore whether SWEET9 of other Brassicaceae is essential for sugar 
efflux from nectaries, we identified the orthologous B. rapa (Br)SWEET9 
in turnip flowers and showed that it transports sucrose (Fig. 1a, b) and is 
essential for sugar efflux and nectar secretion (Fig. 4a, b and Extended 
Data Fig. 9a). Nectar from the rosids Arabidopsis and turnip contains 
predominantly hexoses’, consistent with the role of apoplasmic inver- 
tase in post-secretory sucrose hydrolysis. Other rosids may also use 
SWEET for nectar secretion; for example, microarray data indicate that 
SWEET9 has a role in nectar secretion from extrastaminal nectaries as it 
is the highest expressed SWEET in barrel clover (Medicago truncatula) 
flowers (Extended Data Fig. 10). 
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Nectaries probably evolved multiple times”. Solanales (asterids) have 
gynoecial nectaries, whereas Brassicales typically have extra- and intra- 
staminal nectaries. To identify potential functional orthologues from 
coyote tobacco, SWEETs were cloned using degenerate primers (Sup- 
plementary Table 3). N. attenuata (Na)SWEET9 was most highly expressed 
in gynoecial nectaries; levels increased during nectary maturation (Fig. 4c). 
NaSWEET9 mediated sucrose uptake and efflux when expressed in oocytes 
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Figure 4 | SWEET9 orthologues in B. rapa and N. attenuata are essential for 
nectar secretion. a, Nectar droplets in lateral nectary of wild-type (R-o-18) B. 
rapa flowers. b, Lack of nectar in Brsweet9-1 mutants. c, NaSWEET9 transcript 
accumulation in N. attenuata (error bars show + s.e.m., n = 3). d, Mean 

(+ s.e.m., n = 8) nectar volume of flowers measured at 05:00 in wild-type, 
Nasweet9-1 and Nasweet9-2 plants. e, f, Sucrose uptake (e) and efflux (f) activity 
of NaSWEETS9 in oocytes. Truncated version of NaSWEET9(L201*) 
(NaSWEET9m) served as control. e, Oocyte uptake: NaSWEET9 mediates 
4C_sucrose uptake (errors bars show + s.e.m., n = 14). £,'4C-sucrose efflux by 
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Figure 3 | SPS1 and SPS2 are necessary for nectar 
secretion in Arabidopsis. a,b, amiRNA inhibition 
of AtSPS1 and AtSPS2 gene expression led to a loss 
of nectar secretion. The circles indicate nectar 
droplets in wild-type flowers (a), and the absence 
of nectar in mutant flowers (b). c, d, Starch 
accumulation in nectaries of Atsps miRNA lines 
(floral stalk of sps1f/2f mutant lines, red arrow; only 
in guard cells of wild-type nectaries). Original 
magnification, X10. e, Model for nectar secretion: 
starch-derived sucrose is synthesized in nectaries 
by SPS proteins and exported by SWEET9. 
Exported sucrose is subsequently hydrolysed by 
CWINV4, which creates a high osmotic potential 
to sustain water flow down the osmotic gradient. 
Frc6P, fructose 6-phosphate; UDPeglc, uracil 
diphosphate glucose. 
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(Fig. 4e, f). NaSWEET9 was essential for nectar secretion as shown in 
two independent RNA interference (RNAi) lines (Fig. 4d and Extended 
Data Fig. 9). Together, our results indicate that SWEET9 either arose 
early during core eudicot evolution or has been co-opted independently 
in asterids and rosids for gynoecial and both intra- and extrastaminal 
nectary function. A phylogenetic analysis tentatively traces the appear- 
ance of SWEET9 to a time window nearly at the origin of core eudicots 
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NaSWEET9 in oocytes (error bars show = s.e.m., n = 10). **P<0.01, 
Student’s t-test. g, Data were collected from genome databases 
(http://phytozome.org; http://genomevolution.org; http://bioinformatics. 
psb.ugent.be/plaza/) using AtSWEET9 as bait. Tree branches are a schematic 
representation (Angiosperm Phylogeny Group III system). Species belonging 
to the core eudicot clades of rosids or asterids are underlined in red and 
green, respectively; full names are listed in Supplementary Table 5. Species 
bearing floral (F), extrafloral (EF) and septal (S) nectaries are indicated. 
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~120 million years ago** (Extended Data Figs 1, 2). In all genomes 
analysed, including grasses, Selaginella and Physcomitrella, we found 
multiple SWEET paralogues. Thus SWEET9 may have evolved from 
other clade 3 members when core eudicots merged. SWEET9 seems 
to be absent from wind-pollinated rice and maize lacking nectaries, 
banana, which has a typical monocot septal nectary, and Amborella, a 
member of an early angiosperm lineage. Aquilegia is a highly derived 
genus within the Ranunculales at the base of the eudicots” and relies 
on entomophilous pollination mediated by elaborate nectar-filled nec- 
tar spurs. However, its genome seems to lack SWEET9 orthologues 
(Phytozome search, October 2013). The hypothesis of a single origin 
of SWEETS during the evolution of core eudicots is consistent with 
conclusions from analyses of the master regulator of nectary develop- 
ment CRABS CLAW”. 

Castor bean and cassava, which produce nectar from EFNs, also con- 
tain SWEET9 homologues” (Fig. 4g and Extended Data Figs 1, 2). We 
found that the wind-pollinated poplar, which lacks functional FNs but 
has foliar EFNs, contains SWEET9 and SWEET 10 homologues. The 
sucrose transporter Populus trichocarpa (Pt)SWEET 10a (Extended Data 
Fig. 4) is highly expressed in EFNs”*, indicating that the foliar EFN uses 
SWEET9 paralogues for nectar secretion. Poplar nectar is hexose dom- 
inant, potentially due to cell wall invertases, which are highly expressed 
in the EFN**. These EFNs probably have a role in attracting ants as a 
defence against other insects”. 

Starch accumulation in floral stalks of the mutants indicates that 
phloem-derived sucrose is imported into nectaries symplasmically. Sucrose 
is then hydrolysed and stored as hexoses in the vacuole, or as starch’®. 
During nectary maturation, sucrose is resynthesized via SPS, and SWEET9 
begins to export sucrose down a concentration gradient, leading to 
sucrose accumulation in the apoplasm (Fig. 3e). As SWEET9 seems to 
function as a uniporter, and because the cytosol contains other solutes 
that contribute to the osmotic potential, uniporter-driven efflux is insuf- 
ficient for osmotically driven water secretion. Thus, apoplasmic sucrose 
has to be hydrolysed by secreted invertases to produce glucose and 
fructose, increasing the osmotic driving force and allowing water to be 
secreted. Ultimately, sugary nectar is secreted through open stomata. 
Together, our results show that SWEET9 serves as a sugar efflux trans- 
porter of the nectary parenchyma and is necessary for the secretion of 
nectar in two types of eudicot nectaries. Microarray data show that 
hexose-transporting sugar transporters (STPs) are expressed in nectaries””, 
indicating that they have a role in reuptake. The relative activities of 
apoplasmic invertase combined with selective reuptake possibly deter- 
mine the final ratio of sucrose, fructose and glucose. 


METHODS SUMMARY 


Heterologous expression of SWEET9 from in HEK293T cells and Xenopus oocytes 
was performed as described'*. Mutant Arabidopsis was obtained from the Arabidopsis 
Biological Research Center (ABRC), insertion sites and transcript levels were verified 
by PCRand qRT-PCR, respectively. BrSWEET9 TILLING mutants were obtained 
from RevGen UK. NaSWEET9 was silenced in N. attenuata by RNAi (inverted 
repeat constructs). SPSIF and SPS2F were co-silenced through a single amiRNA 
targeting the messenger RNAs for both genes. Nectar secretion was evaluated micro- 
scopically. Starch was stained using potassium iodide”. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Structure of a type IV secretion system 


Harry H. Low!*+, Francesca Gubellini**, Angel Rivera-Calzada', Nathalie Braun’, Sarah Connery’, Annick Dujeancourt?, Fang Lu, 
Adam Redzej', Rémi Fronzes’, Elena V. Orlova' & Gabriel Waksman! 


Bacterial type IV secretion systems translocate virulence factors into 
eukaryotic cells’”, distribute genetic material between bacteria and 
have shown potential as a tool for the genetic modification of human 
cells’. Given the complex choreography of the substrate through the 
secretion apparatus’, the molecular mechanism of the type IV secre- 
tion system has proved difficult to dissect in the absence of structural 
data for the entire machinery. Here we use electron microscopy to 
reconstruct the type IV secretion system encoded by the Escherichia 
coli R388 conjugative plasmid. We show that eight proteins assemble 
in an intricate stoichiometric relationship to form an approximately 
3 megadalton nanomachine that spans the entire cell envelope. The 
structure comprises an outer membrane-associated core complex’ 
connected by a central stalk to a substantial inner membrane com- 
plex that is dominated by a battery of 12 VirB4 ATPase subunits 
organized as side-by-side hexameric barrels. Our results show a 
secretion system with markedly different architecture, and conse- 
quently mechanism, to other known bacterial secretion systems’*°. 

The canonical type IV secretion (T4S) system comprises 12 proteins, 
VirB1-11 and VirD4, and forms a large macromolecular complex that 
spans the cell envelope of Gram-negative bacteria”. The hub protein 
VirB 10 inserts into both the inner and outer membranes and spans the 
entire width of the periplasm. It is decorated by VirB7 and VirB9 in a 
1:1:1 ratio to form a C,4 symmetrized outer membrane pore termed the 
core complex’. The architecture and relative topological positioning of 
the rest of the T4S system components, particularly at the inner mem- 
brane, are unknown. Three ATPases, VirB4, VirB11 and VirD4, energize 
the secretion apparatus’**. Some T4S systems are known to extend a 
substantial tubular pilus, comprising VirB2 supplemented with a minor 
component, VirB5, beyond the cell boundary’. 
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To provide structural insights into an assembled T4S system, the 
virB/trw operon of the R388 plasmid from virB1/trwN to virB10/trwE 
(T4SS,_19) was cloned, overexpressed in E. coli and the solubilized mem- 
brane fraction subjected to affinity chromatography (Extended Data 
Fig. 1a). Analysis of eluted sample by SDS-polyacrylamide gel electro- 
phoresis (SDS-PAGE) (Fig. 1a), liquid chromatography mass spectro- 
metry (LC-MS) and western blot (Extended Data Fig. 1b) reproducibly 
showed a purified complex consisting of VirB3/TrwM, VirB4/Trwk, 
VirB5/TrwJ, VirB6/Trwl, VirB7/TrwH, VirB8/TrwG, VirB9/TrwF and 
VirB10/TrwE (termed the T4SS3_ 19 complex). Overexpression of virB1/ 
trwN to virB11/trwD with virB10/trwE tagged as in the T4SS;_j9 clone 
yielded the same T4SS3_;9 complex. 

To characterize the three-dimensional architecture of the T4SS3_15 
complex, a negative-stain electron microscopy data set was collected. 
Characteristic views of the complex (class averages) were obtained that 
typically showed double barrel-like densities connected by a thin cent- 
ral stalk to a structure clearly reminiscent of the pKM101 plasmid core 
complex’ (Fig. 1b). Indeed, separate purification of R388 VirB7/TrwH, 
VirB9/TrwF and VirB10/TrwE, and negative-stain electron microscopy 
of the resultant complex, confirmed formation of a pKM101-like core 
complex complete with 14-fold symmetry (Extended Data Fig. 2). In 
the T4SS3_19 assembly, those densities outside the obvious core com- 
plex are broadly termed the inner membrane complex (IMC) (Fig. 1b). 
Significant flexibility between IMC and the core complex was observed 
(Extended Data Fig. 3a-c), which prompted two discrete three-dimensional 
reconstructions to be generated for the two complexes (Extended Data 
Fig. 3d). Final resolutions of 18 A and 23 A were achieved, respectively 
(Extended Data Fig. 4a), with flexibility in the IMC accounting for the 
difference in resolutions. Ultimately, these reconstructions were merged 


Figure 1 | Purification of the R388 
4 encoded T4SS3_19 complex. a, SDS- 
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Figure 2 | Asymmetric composite 
= structure of the T4SS3_19 complex. 
a, Front view. The map is a composite 
generated by merging independently 
processed core complex and IMC 
reconstructions. b, Cut-away front 
view. Electron density is colour- 

a coded, ranging from red to blue, 
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to generate a composite map of the entire T4SS3_) 9 complex (Figs 2 and 
3a and Extended Data Fig. 4b-e). Final maps were generated with no 
symmetry applied. However, C,4 symmetry is clearly observed in the core 
complex and overall C, symmetry in the IMC (Fig. 2). 

The whole T4SS3_ 1) complex is 340 Ain length and comprises a core 
complex connected by a central stalk to an IMC that is 255 A at its 
widest (Fig. 2). The observed core complex is similar to that of the well- 
characterized pKM101 plasmid”®, with a diameter of 185 A and rota- 
tionally symmetric inner (I-) and outer (O-) layers integrated to form 
a chamber that is divided by a central middle platform (Fig. 3b). The 
core complex cap is in an overall closed conformation as in the cryo- 
electron microscopy reconstruction of the pKM101 core complex’, with 
any central constriction’ probably occluded by the negative stain of 
bound detergent or lipid, or both. 

In contrast to previous T4S system core-complex reconstructions 
the inner (I-) chamber is partly occluded by the central stalk (Fig. 3b-d), 
which extends distally from the core complex to form a central binding 
hub within the IMC. The stalk does not show any clear symmetry, and 
in the composite T4SS3_ 1, complex model it makes several limited con- 
tacts with the walls of the I-chamber. The top and middle parts of the 
stalk are linked by four connections that varied in intensity during refine- 
ment, with the central pair being most stable (Fig. 3c and Extended Data 
Fig. 5a). Other connections may exist that were not observed owing to 
marked flexibility in this region (Extended Data Fig. 3b, c). 
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Figure 3 | Segmentation of the T4SS3_19 complex reconstruction. a, Side, 
front and bottom views. Of the two barrels, only the left one is segmented. The 
colour scheme used is upheld in all panels. b, Zoom cut-away view of the core 
complex and stalk. Dashed red line delineates the border at which the separate 
core complex and IMC reconstructions were merged (left). Central cross- 
section schematic of the core complex from this study (right). c, Zoom side view 
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Connected to the stalk distal end are two barrel-like densities related by 
C, symmetry with length 134 A and minimum diameter 105 A (Fig. 2). 
Each barrel comprises three tiers, with the lower and middle tiers con- 
stituting rings with three-fold symmetry oriented around a central 
channel. In addition, a dimeric arrangement of densities around each 
three-fold axis is observed in the middle tier (Fig. 3e). The upper tier is 
more substantial and any central channel appears closed. Overall, each 
barrel comprises three basic elongated segments that are each made up 
of two subunits (Extended Data Fig. 5b). 

Directly above the upper tier of each barrel lies a remarkable struc- 
ture termed the arch, which interconnects between barrel subunits and 
the central stalk (Figs 2 and 3d). Each arch comprises a substantial central 
density that contacts the underlying barrel through up to six thin linkers 
(Extended Data Fig. 5a). Note that not all linkers were consistently 
resolved during refinement and often only a dominant triad of links 
was observed (positions 1, 3 and 5 in Extended Data Fig. 5a). 

To ensure that dry preparation of samples did not affect the integrity 
of the particles, a tilt series experiment using cryo-negative-stain elec- 
tron microscopy was performed. It confirmed the features and dimen- 
sions of the T4SS3_ 1. complex reconstruction described above (Extended 
Data Fig. 6). 

To begin dissecting the internal organization of the T4SS3_1) com- 
plex, the stoichiometry of its constituents was determined using radio- 
iodination". A T4SS3_;9 complex clone variant was used in which a 


Cap 


Front Cross-section 


Bottom 


of the stalk. Some of the linkers between the core complex and IMC are flexible 
and were therefore poorly resolved. d, Cut-away top view of the stalk and 
arches. e, Each barrel-like density consists of three dimeric elongated segments. 
Cross-sections of the lower and middle tiers (right panel) show three-fold 
symmetry with a trimer of dimeric densities present in the middle tier. 
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His-tag was incorporated at the amino (N) terminus of VirB6/TrwI 
(T4SS3_jo/His6-Bo)- The T4SS3_io/His6-36 Complex showed improved 
biochemical stability and was equivalent to the T4SS3_1) complex when 
analysed by electron microscopy (Extended Data Fig. 7a). Using the 
known 14-fold copy number of VirB10/TrwE””* for calibration, the 
relative stoichiometry of VirB4/TrwK, VirB6/TrwI and VirB5/TrwJ 
is 11.5:23.8:11.9 (Fig. 4a). Deconvolution of overlapping iodine-125 
signals for VirB9/TrwF and VirB8/TrwG, based on a copy number 
of 14 for VirB9/TrwF””’, suggests there are 12.6 copies of VirB8/TrwG 
(Fig. 4a). VirB7/TrwH has no tyrosine residue but its stoichiometry 
within the core complex is known and is the same as VirB10/TrwE. 
As radio-iodination was unsuitable for VirB3/TrwM owing to a low 
iodine-125 signal, VirB4/TrwK and VirB3/TrwM were co-expressed 
independently and purified as a distinct complex from the membrane 
fraction. Gel filtration and SDS-PAGE showed that these proteins co- 
elute (Fig. 4b and Extended Data Fig. 7b). Quantification of the respec- 
tive bands showed a stoichiometry of 1:1 (Fig. 4b). Strikingly, it therefore 
seems that all constituents of the T4SS3_;, complex, excluding those of 
the core complex, essentially exist in multiples of 12. The combined 
stoichiometry data suggest the T4SS3_19 complex is approximately 3.4 
MDa, which correlates well with the calculated mass estimation derived 
from the map volume (~3.0 MDa). 

The stacked three-tier architecture of the IMC barrels is reminiscent 
of the hexameric electron microscopy reconstruction of VirB4/TrwK”’. 
Indeed, gold labelling of the T4SS3_1) complex with a His-tag incorporated 
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Figure 4 | Stoichiometric analysis and localization of various 
macromolecular components within the T4SS3_;) complex reconstruction. 
a, Iodine-125 labelling of T4SS3_10/11is6-n6 complex constituent proteins. Left, 
SDS-PAGE of the T4SS3_10/tis6-36 Complex. * indicates minor contaminants 
(OmpF/OmpA and Lpp). Right, SDS-PAGE of '”°I-labelled proteins in left 
lane (Coomassie) and corresponding radiograph in right lane (iodine-125 
labelling). Relative stoichiometry was calculated by integration of band 
intensity and is shown at right. Reported means and corresponding standard 
deviations are from two separate labelling experiments on four independent 
purifications. b, SDS-PAGE of the VirB4/TrwK and VirB3/TrwM complex 
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at the carboxy (C) terminus of VirB4/TrwK (T4SS3_10/ B4-Hise) Shows 
localization of label to both IMC barrels (Fig. 4c and Extended Data 
Fig. 8a, c). Negative-stain electron microscopy of VirB4/TrwK purified 
from the membrane fraction or as an N-terminal maltose-binding pro- 
tein (MBP) fusion shows an elongated and flexible monomer with 
approximate length 105 A (Extended Data Fig. 7c-e), which is consistent 
with the internal length ofan IMC barrel. Such data, combined with the 
VirB4/TrwkK stoichiometry and the observed trimer of dimer symmetry 
within each IMC barrel, support a model in which each barrel contains 
six VirB4/TrwK monomers. Gold labelling of VirB6/TrwI (T4SS3_j0/11is6-B6 
complex) shows a similar localization pattern to that of VirB4/TrwK, 
which suggests the VirB4/TrwK C terminus and VirB6/TrwI N terminus 
are in the same region within the complex (Fig. 4d and Extended Data 
Fig. 8b, c). 

To deduce the position of the inner membrane, the relationship be- 
tween VirB4/TrwkK and the inner membrane was investigated. During 
T4SS3_10/p4-Hiss complex purification, sequential washing of the mem- 
brane fraction with 6 M urea, high pH and high salt solutions failed to 
separate VirB4/TrwK (Extended Data Fig. 7f), which suggests robust 
insertion of VirB4/TrwK within the inner membrane. Incorporating 
data from previous studies'*”, it is proposed that the lower and middle 
tiers of each VirB4/TrwK barrel house the cytoplasmic VirB4/TrwK 
C terminus, whereas the upper tier is either partly or wholly inserted 
within the inner membrane and houses the VirB4/TrwK N terminus. 
Importantly, the soluble VirB4 C-terminal ATPase domain from 
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stained with SYPRO Ruby. Relative stoichiometry was calculated by integration 
of band intensity and is shown at right. Reported means and corresponding 
standard deviations are from two independent purifications. c, Gold labelling 
(5nm) of VirB4/TrwkK clusters around the IMC barrels. Scale bar, 10 nm. 

d, Gold labelling (5 nm) of VirB6/TrwI shows a similar localization pattern to 
that of VirB4/Trwk (see c). Scale bar, 10 nm. e, Fit of the VirB4 ATPase domain 
from T. pseudethanolicus (4AG5), the pkKM101 outer membrane complex 
(3JQ0) and the in silico model of VirB9/TraO (3ZB)J). f, Summary schematic 
showing the localization of known components and position of cell 
membranes. For clarity, only VirB nomenclature is used in the colour key. 
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Thermoanaerobacter pseudethanolicus fits optimally as a trimer of 
dimer arrangement within the lower and middle tiers (Fig. 4e)'®. 

The compelling fit of the pXM101 outer membrane complex struc- 
ture’ and in silico VirB9/TraO I-layer model’ into the core complex of 
the T4SS;_1 reconstruction (Fig. 4e) highlights the unexpected distance 
(~40 A) between the I-layer base and the proposed position of the 
inner membrane (Fig. 4f). The VirB10/TrwE N termini, extending 
distally from the base of the I-layer, either as part of the stalk or as 
distinct strands, must span this space to insert into the inner mem- 
brane (Extended Data Fig. 9a). This architecture suggests that substrate 
could also access the core complex secretion chamber from within the 
periplasm. Such a model is compatible with some T4S systems thought 
to use a two-step process’>’®. Substrate would first cross the inner mem- 
brane using either the VirD4/TrwB coupling protein, VirB4/TrwkK, or 
the Sec machinery in some other T4S systems. Then, secondarily, sub- 
strate would be channelled into the core complex and extruded (Extended 
Data Fig. 9b). Evidence is also mounting for interaction between VirD4/ 
TrwB, VirB4/TrwK and VirB11/TrwD”. VirB4/TrwK hexamers could 
recruit, or even be replaced by, VirB11/TrwD or VirD4/TrwB, which 
would allow different enzymatic and functional gearing of the T4S ma- 
chinery during the secretion cycle. 

When compared with the type III secretion (T3S) system’* (the only 
known structure for an assembled secretion system; Extended Data 
Fig. 10), the T4S system shows a fundamentally different architecture. 
While T3S systems are organized as a series of integrated ring-like 
structures that form a continuous tubular conduit, T4S systems show 
a more modular design with a two-fold symmetric inner-membrane 
complex conjoined by a central stalk to a concentric outer membrane 
channel. Thus T4S systems represent a radically different evolutionary 
approach, design and mechanism for the translocation of substrate 
across the bacterial cell envelope. 


METHODS SUMMARY 


The region from virB1/trwN to virB10/trwE (T4SS,_10) was cloned from the E. coli 
R388 plasmid into pASK3c and pBAD vectors with a StrepII tag encoding sequence 
at the 3’ terminus of virB10/trwE. Constructs T4SSj_19/p4-tiss and T4SS}_10/t1is6-B6 
derived from pBAD:T4SS,_;9 had His6-tags inserted at the C and N termini of 
VirB4/TrwK and VirB6/TrwIl, respectively. The pASK3c:T4SS3_4 construct was 
used to express the VirB3/TrwM-VirB4/TrwK complex. The T4SS3_19 complex 
was extracted from bacterial membranes by mild solubilization using a detergent 
mixture of 0.5% w/v n-dodecyl-B-p-maltopyranoside (DDM), 0.75% w/v Decyl 
Maltose Neopentyl Glycol (DM-NPG) and 0.1% w/v digitonin, and purified by the 
StrepII tag. Gold labelling of the T4SS3_19/p4-1ise6 aNd T4SS3_j0/14is6-n6 Complexes 
used 5nm diameter Ni-NTA nanogold beads. Stoichiometry was determined by 
iodine-125 incorporation in tyrosine residues using purified T4SS3_jo/t1is6-B6- Samples 
were stained with uranyl acetate for negative-stain electron microscopy, or ammo- 
nium molybdate for cryo-negative-stain electron microscopy, on carbon-coated 
copper grids. Grids were examined using mostly a Tecnai F20 FEG microscope 
operating at a voltage of 200 kV. Images were recorded on a Gatan UltraScan 4000 
CCD camera. For the T4SS3_19 complex, a total of 21,680 particles were selected 
manually using BOXER”. Contrast transfer function (CTF) was estimated and cor- 
rected using CTFFIND3 (ref. 20) and BSOFT™", respectively. Particle alignment steps, 
multivariate statistical analysis (MSA), angular reconstitution, three-dimensional 
reconstruction and structure refinement were performed using IMAGIC” and 
SPIDER™. The core complex/stalk and IMC regions were then separately pro- 
cessed to generate two discrete maps. The two map densities were adjusted to 
equivalent standard deviation and then overlaid using the core complex I-layer 
as reference for superimposition. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13280 


Corrigendum: Rapid 
remobilization of magmatic 
crystals kept in cold storage 


Kari M. Cooper & Adam J. R. Kent 


Nature 506, 480-483 (2014); doi:10.1038/nature12991 


In Fig. 1 of this Letter, a drafting error meant that the labels for the ver- 
tical axis were offset by one order of magnitude (for example, the tick 
mark labelled ‘10° should have been labelled ‘10”). Figure 1 has been 
corrected online. 
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CORRIGENDUM 
doi:10.1038/nature13299 


Corrigendum: Cllorf95-RELA 
fusions drive oncogenic NF-«B 
signalling in ependymoma 

Matthew Parker, Kumarasamypet M. Mohankumar, 
Chandanamali Punchihewa, Ricardo Weinlich, 

James D. Dalton, Yongjin Li, Ryan Lee, Ruth G. Tatevossian, 
Timothy N. Phoenix, Radhika Thiruvenkatam, Elsie White, 
Bo Tang, Wilda Orisme, Kirti Gupta, Michael Rusch, 

Xiang Chen, Yuxin Li, Panduka Nagahawhatte, Erin Hedlund, 
David Finkelstein, Gang Wu, Sheila Shurtleff, John Easton, 
Kristy Boggs, Donald Yergeau, Bhavin Vadodaria, 

Heather L. Mulder, Jared Becksfort, Pankaj Gupta, 

Robert Huether, Jing Ma, Guangchun Song, Amar Gajjar, 
Thomas Merchant, Frederick Boop, Amy A. Smith, Li Ding, 
Charles Lu, Kerri Ochoa, David Zhao, Robert S. Fulton, 
Lucinda L. Fulton, Elaine R. Mardis, Richard K. Wilson, 
James R. Downing, Douglas R. Green, Jinghui Zhang, 
David W. Ellison & Richard J. Gilbertson 


Nature 506, 451-455 (2014); doi:10.1038/nature13109 


In this Article, author Jared Becksfort’s surname was incorrectly 
spelled ‘Becksford’. Also, his affiliation should be the Department of 
Computational Biology and Bioinformatics (not the Department of 
Pathology) at St Jude Children’s Research Hospital. His name and 
affiliation have been corrected online. 
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Funder storm 


A confluence of budget disappointments has confounded 
US scientists and left many uncertain about their future. 


BY HELEN SHEN 


| ast May, Heidi Fisher feared that her 


career as an evolutionary geneticist might 
be over. Although she was a postdoc in 
a large, successful lab at Harvard University in 
Cambridge, Massachusetts, and had numerous 
publications under her belt, she was struggling 
to secure a faculty position ata university where 


she could continue her studies on adaptive 
mating strategies in deer mice. 

Her 2012 application for a K99 grant from 
the US National Institutes of Health (NIH) in 
Bethesda, Maryland — funding that would 
help her to establish an independent lab — 
scored well by a typical year’s standards. But 
looming spending cuts in 2013 and a highly 
uncertain financial outlook pushed back the 
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NIH’s payline (the score separating grants 
that will be funded from ones that fall short), 
bumping Fisher’s application into a bal- 
looning ‘maybe’ category, to be decided on 
once the agency could fully assess its budget 
prospects. 

After a nail-biting, half-year delay, Fisher 
heard that she would not receive the grant. 
Crushed, she considered seeking a job in bio- 
technology, education, legal services or even 

art. “I thought, ‘IfI can't get a grant now, am 

I going to be struggling like this my whole 

career?,” she says. 

Then, in an unexpected turn of events a 
month later, she learned that the NIH had 
found a small amount of extra funding to sup- 
port her award. She has now been offered a fac- 
ulty position at a leading US research university. 

Such twists and turns have become more 
common across the United States as a result 
of drawn-out budget disruptions. First came 
across-the-board government spending cuts 
known as sequestration — the result of Con- 
gress’s failure to agree on a deficit-reduction 
strategy — which lopped 5% from science 
agencies’ budgets on 1 March 2013. Then, 
in October, partisan budget battles forced a 
16-day government shutdown (see Nature 502, 
13-14; 2013) that suspended nearly all feder- 
ally funded science programmes, with dire 
consequences for some time-sensitive research. 

The outlook has brightened somewhat, 
however: in January, Congress released a 2014 
spending deal that suspended sequestration for 
two years and slightly boosted the budgets for 
the US National Science Foundation (NSF) in 
Arlington, Virginia, and the US Department 
of Energy (DOE) Office of Science over 2012 
levels. The NIH, however, took a US$800-mil- 
lion cut, continuing a ten-year downward trend 
(see Nature 505, 461-462; 2014). 

But some observers worry that last year’s 
trauma will continue to influence the attitudes 
and decisions of scientists in training for some 
time. “A graduate student who looks at their 
principal investigator and sees how much trou- 
ble they’re having is discouraged from going out 
into academia and setting up their own lab,’ says 
Laurie Glimcher, dean of Weill Cornell Medical 
College in New York City. “I'm sure we're going 
to lose some of the next generation of scientists.” 


GRIM STATISTICS 

Sequestration slashed funding for the NIH by 
$1.55 billion, and grant-application success 
rates fell to a historic low of 16.8%, accord- 
ing to Sally Rockey, NIH deputy director > 
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> for extramural research. In an analysis last 
month of R-series grants — the most common 
type of NIH research grant — the American 
Society for Biochemistry and Molecular Biol- 
ogy in Rockville, Maryland, reported that the 
agency funded 1,000 fewer people last year, a 
4% drop from the previous year: the number 
dropped by only 150, or 0.5%, between 2011 
and 2012. “Not getting funded for one year is 
not necessarily the end of a career,” says the 
society's president, Jeremy Berg, “but some 
people are at the end of their rope.” 

Yuntao Wu, an HIV researcher at George 
Mason University in Manassas, Virginia, ran 
out of NIH funding in April 2013 after two 
failed attempts to renew his RO1 grant. He 
laid off his only technician, and for the past 
year has been running his lab on about $3,000 
per month, scraped together from university 
funds, industry contributions and a private 
fundraising campaign. The budget barely cov- 
ers the cost of keeping his cell cultures alive 
and conducting a few small experiments, says 
Wu. He submitted around 14 applications to 
the NIH last year, and is optimistic that at least 
one will be successful. But he worries about 
the lasting impressions of the past year on his 
graduate students, whose research has slowed 
to acrawl. “They saw me struggle. They saw 
the lab really struggle,” he says. 


BUMPY RIDE 

NIH-funded researchers were not alone in suf- 
fering the sting of sequestration. At the NSE, 
690 fewer grants were awarded in 2013 — down 
6% from 2012 — according to a report released 
by the US Government Accountability Office 
last month. 

The physical sciences have seen some 
recovery after the initial shock in March 2013. 
Sequestration had threatened to close or delay 
construction on several large-scale physics 
initiatives, including the Alcator C-Mod mag- 
netic fusion experiment at the Massachusetts 
Institute of Technology in Cambridge (see 
Nature 487, 420; 2012). But in June 2013, 
responding to Congress, the DOE shifted 
some money to sustain the C-Mod and cancel 
scheduled lay-offs (see Nature 498, 527-528; 
2013). The project, together with other phys- 
ics initiatives, also received modest, life-saving 
funding boosts as part of the $1.1-trillion 2014 
federal spending deal approved by Congress in 
January (see Nature http://doi.org/r73; 2014). 

Other big projects, such as the Alaska Vol- 
cano Observatory (AVO), have not fared so 
well (see Nature http://doi.org/r74; 2013). 
Funding for the University of Alaska Fairbanks’ 
part in the volcano-monitoring programme — 
a collaboration with the US Geological Survey 
in Reston, Virginia, and the Alaska Division of 
Geological and Geophysical Surveys in Fair- 
banks — shrank from $1 million to $513,000, 
according to Jeff Freymueller, coordinating sci- 
entist for the project. And that topped off years 
of dwindling research budgets, he says. “We're 
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on the verge of losing monitoring capabilities 
on a couple of volcanoes.” 

Losing real-time eruption data from the 
monitoring network has dealt a blow to the 
next generation of volcanologists, adds Frey- 
mueller, as well as threatening aviation safety. 

Since the sequester cuts, the university has 
not been able to replace the outgoing AVO 
graduate students who had been mining data 
from the roughly 200 seismic stations for 
insights into how volcanoes work. 


TRAINING TEST 

The full effects of sequestration on graduate 
students and postdocs around the country 
have been difficult to quantify. There are some 
regional indications, however: last October, 
a survey of 230 trainees at the University of 
California, San Francisco, showed that 71% 
of graduate students and 78% of postdocs 
thought that the current budget climate had 
decreased their desire to pursue research 
careers in academia. The university’s Science 
Policy Group, the campus organization that 
designed the survey, hopes to repeat the sur- 
vey in other universities. 

Career-development professionals at several 
institutions have noticed a few common pat- 
terns emerging among young scientists. For 
example, at the Jefferson Graduate School of 
Biomedical Sciences in Philadelphia, Pennsyl- 
vania, Lisa Kozlowski, associate dean for post- 
doctoral affairs and recruitment, is counselling 
more and more postdocs who have been given 
just a few months to get their affairs in order 
and leave their labs when their funding runs out. 

The turmoil of the past year should serve 
as a wake-up call 
for all trainees, says 
Kozlowski, even for 
those who escaped 
the worst effects of 
the sequestration. She 
says that too many 


graduate students and 

postdocs procrasti- 

nate on making long- 

term career plans, or 

1 ae simply overlook their 

re , importance. Advance 
If [can’t get planning, including 
a grant now, professional net- 
amIgoingtobe working and cultivat- 
struggling like _ ing leadership skills, 
this my whole are more important 
career?” than ever in an uncer- 
Heidi Fisher tain funding climate. 


“Start early, even 
when you think you're set in the lab and well- 
funded,’ she advises. 

Many career-development professionals 
say that bleak budgets are pushing trainees to 
look outside of academia. Sibby Anderson- 
Thompkins, director of postdoctoral affairs 
at the University of North Carolina at Chapel 
Hill, saw a particular shift last year in interest 


© 2014 Macmillan Publishers Limited. All rights reserved 


*% 


Yuntao Wu was unable to renew his NIH grant. 


-~s 


towards careers in science communication 
and science policy. “I think they realize with 
sequestration that decision-makers often 
don't understand research and the impact of 
research,’ she says. 

One of those mulling such a move is post- 
doc Melissa O'Connor of the Medical Univer- 
sity of South Carolina in Charleston. Facing 
tight budgets, her supervisor elected to spend 
the rest of his RO1 grant on collecting data to 
bolster a future grant application rather than 
applying to renew his funding. That means that 
support for O'Connor's work on immune-cell 
signalling will run out in May. 

O’Connor has dabbled in science advo- 
cacy, and in October 2012, she was selected 
for a one-year science-policy fellowship by 
the American Society for Pharmacology and 
Experimental Therapeutics in Bethesda. 
“Whether or not I get a career in that direc- 
tion,’ she says of her new-found interest, “it’s 
always going to be something I’m actively 
involved in.” 

Soon enough, O’Connor and other young 
scientist-advocates may find themselves 
working to deal with a fresh set of budget 
challenges. As Congress negotiates a federal 
budget for the 2015 fiscal year, the approach- 
ing elections will probably polarize dis- 
cussions along the same party divides that 
touched off last year’s chaos, notes Jennifer 
Zeitzer, director of legislative relations at the 
Federation of American Societies for Experi- 
mental Biology in Bethesda. And unless 
Congress can agree on an alternative plan, 
sequestration is set to restart in 2016. “It’s just 
a little bit of breathing room before the next 
round of budget problems,’ says Zeitzer. = 


Helen Shen is a freelance writer based in 
Mountain View, California. 
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TURNING POINT 


Andrew Zwicker 


Andrew Zwicker, head of science education at 
the Princeton Plasma Physics Laboratory in 
New Jersey, announced his candidacy on 

9 March as a Democratic congressman for the 
state. If elected, he'll be the fourth physicist 
ever to hold a federal legislative office. 


How did you end up making science 
education such a big part of your career? 
Two career-changing events happened when 
I was a postdoc. First, I met Rush Holt before 
he became US Representative Holt (Demo- 
crat, New Jersey). At the time, he was creating 
the Plasma Physics Laboratory's science- 
education programme, and recruited me as 
a mentor. Second, I mentored a student from 
a public high school in Trenton, New Jersey, 
who had amazing maturity and drive but was 
behind in her schoolwork. I spent hours help- 
ing her to write a computer program to ana- 
lyse scientific data. She got a full scholarship 
to university, and I learned how rewarding 
it is to mentor young scientists. Soon after, I 
was named as head of the science-education 
programme. I went from fusion researcher to 
science educator. Now I’m researching the use 
of electric glow discharges, such as those from 
fluorescent lamps, as teaching aids. 


How have your interests evolved? 

While leading the science-education pro- 
gramme, I was interested in ways to get people 
engaged in science. In 2006, I won first prize in 
an art-science competition run by the univer- 
sity for an image I took of a floating dust cloud 
of silica microspheres suspended in a plasma 
and illuminated by laser light (see go.nature. 
com/ckpmnl). But I am perhaps proudest of 
receiving an honourable mention for a video 
explaining what a flame is to an 11-year-old — 
an entry toa competition run by the Alan Alda 
Center for Communicating Science at Stony 
Brook University in New York. Scientists are 
stereotyped as being bad communicators, but I 
feel that we have a responsibility to communi- 
cate what we are doing with taxpayers’ money. 
If the public isn’t behind us, then appropriators 
in Congress will not see the value of our work. 


Have you always had political aspirations? 

No. I can’t say that running for Congress 
has been a lifelong ambition. When Rush 
announced that he wasn't going to run again, 
lots of people were caught off guard. Rush 
has a lot of respect in New Jersey. People 
started asking me if would consider running 
because my narrative is so similar to his — 
we are both plasma physicists involved with 


Princeton's science-education programme. 
When a Republican approached me, saying 
that the country needed people who use facts 
and evidence to make decisions rather than 
being motivated by other factors and ideolo- 
gies, I started to seriously consider running. 


What advice has Holt offered you? 

We have discussed the current state of affairs 
in Washington DC and the difficulties asso- 
ciated with gridlock. He told me that both 
Democrats and Republicans would turn to 
him or his staff on science issues. So being a 
scientist could help me to reach both parties. 


What persuaded you to run? 

The undergraduates on campus were so 
enthusiastic, and asked hard questions. They 
desperately want the government to address 
big issues such as energy and climate and to 
help to make Earth a better place. IknowI was 
talking to youthful, idealistic people, but it was 
overwhelming. I realized I had to do it. 


Does being a physicist help or hinder your 
chances? 

There are four candidates. I would not bea 
viable candidate in this district if 1 were nota 
scientist — three of my opponents are career 
politicians, have been in some level of politics 
for 20 years and have name recognition. One of 
my goals is to get younger people, who do not 
usually vote in mid-term elections, to the polls. 


What’s your campaign slogan? 

Rush is well known for a bumper sticker that 
reads: “My Congressman is a rocket scien- 
tist” If people vote for me, they can keep their 
bumper sticker and just add the word “still”. m 


INTERVIEW BY VIRGINIA GEWIN 


© 2014 Macmillan Publishers Limited. All rights reserved 


2 


WORK-LIFE BALANCE 
Burnout predictors 


Women are more likely than men to find 
that work interferes with their home life, 
and consequently are more likely to burn 
out, finds a study (V. Blom et al. J. Occup. 
Environ. Med. 56, 361-366; 2014). The 
authors surveyed 4,446 twins in Sweden 

to investigate the determinants of 
work-home and home-work interference. 
Although genetic effects were minimal, 
they found differences in stress levels 
between the sexes. Lead author Victoria 
Blum, a psychologist at the Karolinska 
Institute in Stockholm, says that women 
typically have more home responsibilities 
and thus a greater total workload than 
men. To avoid burning out, she says, 
researchers need to find ways to work from 
home when possible and to share home- 
based responsibilities more evenly. 


IMMIGRATION 


UK visa fast-track 


Research Councils UK (RCUK) is piloting 
ascheme to streamline the entry of 
researchers into Britain under the “Tier 1 
(Exceptional Talent)’ visa. It has teamed 
up with the Royal Society, the British 
Academy and the Royal Academy of 
Engineering to guarantee endorsement 
for researchers who are awarded specific 
RCUK fellowships and grants, a process 
that usually takes nine weeks. Rosie Beales, 
senior policy manager at the RCUK, 

says that the agency will assess the pilot 
scheme’s success in the next six months. 
Representatives of UK universities and 
scientific organizations have warned that 
complicated visa procedures have made 
Britain an unattractive destination for 
scholars (see Nature 506, 14-15; 2014). 


SOCIAL MEDIA 
Science 2.0 


Using social media more often would 
help scientists to disseminate their results, 
debate findings and engage a wider 
audience, says a study (C. Greenhow 

and B. Gleason Br, J. Educ. Technol. 45, 
392-402; 2014). Co-author Christine 
Greenhow at Michigan State University 
in East Lansing, says that researchers 
must learn to create a robust online 
presence. As part of an upcoming study, 
she polled 1,600 US and Canadian faculty 
members and found that 15% use Twitter, 
28% use YouTube and 39% use Facebook 
for scholarly activity. Greenhow is calling 
for social-media metrics to be added to 
the tenure process. 
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